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The fine print in the upper right-hand 
corner of each page is a draft of intended 
index entries; it won’t appear in the real book. 
Some index entries will be in typewriter type 
and/or preceded by \ or enclosed in {. . .), etc; 
such typographic distinctions aren’t shown here. 
An index entry often extends for several pages; 
the actual scope will be determined later. 
Please note things that should be indexed but aren’t. 
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Preface 


dangerous bend 


G entle Reader: This is a handbook about TgjX, a new typesetting system 
intended for the creation of beautiful books — and especially for books 
that contain a lot of mathematics. By preparing a manuscript in T^X format, 
you will be telling a computer exactly how the manuscript is to be transformed 
into pages whose typographic quality is comparable to that of the world’s finest 
printers; yet you won’t need to do much more work than would be involved if 
you were simply typing the manuscript on an ordinary typewriter. In fact, your 
total work will probably be significantly less, if you consider the time it ordinarily 
takes to revise a typewritten manuscript, since computer text files are so easy 
to change and to reprocess. (If such claims sound too good to be true, keep in 
mind that they were made by Tf^X’s designer, on a day when TgX happened to 
be working, so the statements may be biased; but read on anyway.) 

This manual is intended for people who have never used TgX before, 
as well as for experienced TgX hackers. In other words, it’s supposed to be a 
panacea that satisfies everybody, at the risk of satisfying nobody. Everything you 
need to know about T^X is explained here somewhere, and so are a lot of things 
that most users don’t care about. If you are preparing a simple manuscript, you 
won’t need to learn much about T^X at all; on the other hand, some things that 
go into the printing of technical books are inherently difficult, and if you wish to 
achieve more complex effects you will want to penetrate some of TgX’s darker 
corners. In order to make it possible for many types of users to read this manual 
effectively, a special sign is used to designate material that is for wizards only: 
When the symbol 



appears at the beginning of a paragraph, it warns of a “dangerous bend” in 
the train of thought; don’t read the paragraph unless you need to. Brave and 
experienced drivers at the controls of TgX will gradually enter more and more 
of these hazardous areas, but for most applications the details won’t matter. 

All that you really ought to know, before reading on, is how to get a 
file of text into your computer using a standard editing program. This manual 
explains what that file ought to look like so that TgX will understand it, but basic 
computer usage is not explained here. Some previous experience with technical 
typing will be quite helpful if you plan to do heavily mathematical work with 
TgX, although it is not absolutely necessary. TgX will do most of the necessary 
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JOKES 

truth 


formatting of equations automatically; but users with more experience will be 
able to obtain better results, since there are so many ways to deal with formulas. 
Some of the paragraphs in this manual are so esoteric that they are rated 


everything that was said about single dangerous-bend signs goes double for these. 
You should probably have at least a month’s experience with TgX before you 
attempt to fathom such doubly dangerous depths of the system; in fact, most 
people will never need to know TgX in this much detail, even if they use it every 
day. After all, it’s possible to drive a car without knowing how the engine works. 
Yet the whole story is here in case you’re curious. (About TgX, not cars.) 

The reason for such different levels of complexity is that people change 
as they grow accustomed to any powerful tool. When you first try to use TpX , 
you’ll find that some parts of it are very easy, while other things will take some 
getting used to. A day or so later, after you have successfully typeset a few 
pages, you’ll be a different person; the concepts that used to bother you will now 
seem natural, and you’ll be able to picture the final result in your mind before it 
comes out of the machine. But you’ll probably run into challenges of a different 
kind. After another week your perspective will change again, and you’ll grow in 
yet another way; and so on. As years go by, you might become involved with 
many different kinds of typesetting; and you’ll find that your usage of TpX will 
keep changing as your experience builds. That’s the way it is with any powerful 
tool: There’s always more to learn, and there are always better ways to do what 
you’ve done before. At every stage in the development you’ll want a slightly 
different sort of manual. You may even want to write one yourself. By paying 
attention to the dangerous bend signs in this book you’ll be better able to focus 
on the level that interests you at a particular time. 

Computer system manuals usually make dull reading, but take heart: 
This one contains JOKES every once in a while, so you might actually enjoy 
reading it. (However, most of the jokes can only be appreciated properly if you 
understand a technical point that is being made — so read carefully .) 

Another noteworthy characteristic of this manual is that it doesn’t al- 
ways tell the truth. When certain concepts of TpX are introduced informally, 
general rules will be stated; afterwards you will find that the rules aren’t strictly 
true. In general, the later chapters contain more reliable information than the 
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earlier ones do. The author feels that this technique of deliberate lying will ac- 
tually make it easier for you to learn the ideas. Once you understand a simple 
but false rule, it will not be hard to supplement that rule with its exceptions. 

In order to help you internalize what you’re reading, EXERCISES are 
sprinkled through this manual. It is generally intended that every reader should 
try every exercise, except for questions that appear in the “dangerous bend” 
areas. If you can’t solve a problem, you can always look up the answer. But 
please, try first to solve it by yourself; then you’ll learn more and you’ll learn 
faster. Furthermore, if you think you do know the solution, you should turn to 
Appendix A and check it out, just to make sure. 

The TgX language described in this book is similar to the author’s first 
attempt at a document formatting language, but the new system differs from 
the old one in literally thousands of details. Both languages have been called 
TgX; but henceforth the old language should be called TgX78, and its use should 
rapidly fade away. Let’s keep the name TgX for the language described here, 
since it is so much better, and since it is not going to change any more. 

I wish to thank the hundreds of people who have helped me to formulate 
this “definitive edition” of the TgX language, based on their experiences with 
preliminary versions of the system. My work at Stanford has been generously 
supported by the National Science Foundation, the Office of Naval Research, the 
IBM Corporation, and the System Development Foundation. I also wish to thank 
the American Mathematical Society for its encouragement, for establishing the 
TgX Users Group, and for publishing the TUGboat newsletter (see Appendix J). 

Stanford, California — D. E. K. 

June 1983 


‘Tis pleasant, sure, to see one's name in print; 
A book's a book, although there's nothing in 't. 
— BYRON, English Bards and Scotch Reviewers (1809) 

A question arose as to whether we were covering the field 
that it was intended we should fill with this manual. 
— RICHARD R. DONNELLEY, Proceedings, United Typothetae of America (1897) 
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Chapter 1: The Name of the Game 


English words like ‘technology’ stem from a Greek root beginning with the letters 
rex ■ ■ ■ ; and this same Greek word means art, as well as technology. Hence the 
name TgjX, which is an uppercase form of rey. 

Insiders pronounce the x of TeX as a Greek chi, not as an ‘x’, so that 
TgX rhymes with the word blecchhh. It’s the ‘ch’ sound in Scottish words like 
loch or German words like a ch; it’s a Spanish ‘j’ and a Russian ‘ldi’. When you 
say it correctly to your computer, the terminal may become slightly moist. 

The purpose of this pronunciation exercise is to remind you that TgX 
is primarily concerned with high-quality technical manuscripts: Its emphasis is 
on art and technology, as in the underlying Greek word. If you merely want to 
produce a passably good document — something acceptable and basically read- 
able but not really beautiful — a simpler system will usually suffice. With TgX 
the goal is to produce the finest quality; this requires more attention to detail, 
but you will not find it much harder to go the extra distance, and you’ll be able 
to take special pride in the finished product. 

On the other hand, it’s important to notice another thing about TeX’s 
name: The ‘E’ is out of kilter. This displaced ‘E’ is a reminder that TeX is about 
typesetting, and it distinguishes TeX from other system names. In fact, TEX 
(pronounced tecks ) is the admirable Text Executive processor developed by 
Honeywell Information Systems. Since these two system names are pronounced 
quite differently, they should also be spelled differently. The correct way to refer 
to TeX in a computer file, or when using some other medium that doesn’t allow 
lowering of the ‘E’, is to type ‘TeX’. Then there will be no confusion with similar 
names, and people will be primed to pronounce everything properly. 

► EXERCISE 1.1 

After you have mastered the material in this book, what will you be: A TEXpert, 
or a TEXnician? 


They do certainly give 
very strange and new-fangled names to diseases. 
— PLATO, The Republic, Book 3 (c. 375 B.C.) 

Technique! The very word is like the shriek 
Of outraged Art. It is the idiot name 
Given to effort by those who are too weak, 
Too weary, or too dull to play the game. 
— LEONARD BACON, Sophia Trenton (1920) 


TeX (actually Tj.;X j , meaning of 
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Chapter 2: Book Printing versus Ordinary Typing 


3 


When you first started using a computer terminal, you probably had to adjust 
to the difference between the digit *1’ and the lowercase letter ‘1’. When you 
take the next step to the level of typography that is common in book publishing, 
a few more adjustments of the same kind need to be made; your eyes and your 
fingers need to learn to make a few more distinctions. 

In the first place, there are two kinds of quotation marks in books, 
but only one kind on the typewriter. Even your computer terminal, which has 
more characters than an ordinary typewriter, probably has only a non-oriented 
double-quote mark ("), because the standard ASCII code for computers was not 
invented with book publishing in mind. However, your terminal probably does 
have two flavors of single-quote marks, namely ‘ and ’ ; the second of these is 
useful also as an apostrophe. American keyboards usually contain a left-quote 
character that shows up as something like ' , and an apostrophe or right-quote 
that looks like 1 or '. 

To produce double-quote marks with TgX, you simply type two single- 
quote marks of the appropriate kind. For example, to get the phrase 

“I understand.” 

(including the quotation marks) you should type 

‘ ‘ I understand . ’ ’ 


quotation marks 
ASCII 
apostrophe 
blank space 


to your computer. 

A typewriter-like style of type will be used throughout this manual to 
indicate Tj^X constructions that you might type on your terminal, so that the 
symbols actually typed are readily distinguishable from the output TgX would 
produce and from the comments in the manual itself. Here are the symbols to 
be used in the examples: 

ABCDEFGHIJKLMNOPQRSTUVWXYZ 
abcdef ghi j klmnopqrstuvwxyz 
0123456789"#$°/, &@*+-=, . : ;?! 

()<>[]{}“ 

If your computer terminal doesn’t happen to have all of these, don’t despair; 
Tf’X can make do with the ones you have. An additional symbol 


is used to stand for a blank space, in case it is important to emphasize that a 
blank space is being typed; thus, what you really type in the example above is 

‘ ‘ I u understand . ’ ’ 

Without such a symbol you would have difficulty seeing the invisible parts of 
certain constructions. But we won’t be using ‘ u ’ very often, because spaces are 
usually visible enough. 
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Book printing differs significantly from ordinary typing with respect to 
dashes, hyphens, and minus signs. In good math books, these symbols are all 
different; in fact there usually are at least four different symbols: 

a hyphen (-); 
an en-dash (-); 
an em-dash ( — ); 
a minus sign (— ). 

Hyphens are used for compound words like ‘daughter-in-law’ and ‘X-rated’. En- 
dashes are used for number ranges like ‘pages 13-34’, and also in contexts like 
‘exercise 1.2.6-52’. Em-dashes are used for punctuation in sentences — they are 
what we often call simply dashes. And minus signs are used in formulas. A 
conscientious user of TgX will be careful to distinguish these four usages, and 
here is how to do it: 

for a hyphen, type a hyphen (-); 
for an en-dash, type two hyphens ( — ); 

for an em-dash, type three hyphens ( ); 

for a minus sign, type a hyphen in mathematics mode ($-$). 

(Mathematics mode occurs between dollar signs; it is discussed later, so you 
needn’t worry about it now.) 

► EXERCISE 2.1 

Explain how to type the following sentence to TpX : Alice said, “I always use an 
en-dash instead of a hyphen when specifying page numbers like ‘480-491’ in a 
bibliography.” 

► EXERCISE 2.2 

What do you think happens when you type four hyphens in a row? 

If you look closely at most well-printed books, you will find that certain 
combinations of letters are treated as a unit. For example, this is true of the 
‘f ’ and the ‘i’ of ‘find’. Such combinations are called ligatures , and professional 
typesetters have traditionally been trained to watch for letter combinations such 
as ff, fi, fl, ffi, and ffl. (The reason is that words like ‘find’ don’t look 
very good in most styles of type unless a ligature is substituted for the letters 
that clash. It’s somewhat surprising how often the traditional ligatures appear 
in English; other combinations are important in other languages.) 

► EXERCISE 2.3 

Think of an English word that contains two ligatures. 

The good news is that you do not have to concern yourself with liga- 
tures: TgX is perfectly capable of handling such things by itself, using the same 
mechanism that converts ‘ — ’ into ‘-’. In fact, TpX will also look for combi- 
nations of adjacent letters (like ‘A’ next to ‘V’) that ought to be moved closer 
together for better appearance; this is called kerning. 


dashes 

hyphens 

minus signs 

En-dash 
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bibliography 

ligatures 
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To summarize this chapter: When using Tf'X for straight copy, you type 
the copy as on an ordinary typewriter, except that you need to be careful about 
quotation marks, the number 1, and various kinds of hyphens/dashes. TpX will 
automatically take care of other niceties like ligatures and kerning. 

f (Are you sure you should be reading this paragraph? The “dangerous bend” 
sign here is meant to warn you about material that ought to be skipped on first 
reading. And maybe also on second reading. The reader-beware paragraphs sometimes 
refer to concepts that aren’t explained until later chapters.) 

f lf your keyboard does not contain a left-quote symbol, you can type \lq, 
followed by a space if the next character is a letter, or followed by a \ if the 
next character is a space. Similarly, \rq yields a right-quote character. Is that clear? 

\lq\lq u Iuunderstand . \rq\rq\ u 

f in case you need to type quotes within quotes, for example a single quote 
followed by a double quote, you can’t simply type ’ ’ ’ because TjXX will 
interpret this as ’” (namely, double quote followed by single quote). If you have already 
read Chapter 5, you might expect that the solution will be to use grouping — namely, 
to type something like { ’ } ’ ’ . But it turns out that this doesn’t produce the desired 
result, because there is usually less space following a single right quote than there is 
following a double right quote: What you get is ’”, which is indeed a single quote 
followed by a double quote (if you look at it closely enough), but it looks almost 
like three equally spaced single quotes. On the other hand, you certainly won’t want 
to type because that space is much too large — it’s just as large as the space 

between words — and IjrX might even start a new line at such a space when making up 
a paragraph! The solution is to type ’Xthinspace’ ’, which produces ’” as desired. 

►EXERCISE 2.4 

JL OK, now you know how to produce ”’ and how do you get and “ ! ? 

►EXERCISE 2.5 

JL Why do you think the author introduced the control sequence \thinspace to 
solve the adjacent-quotes problem, instead of recommending the trickier construction 
’$\,$’ ’ (which also works)? 


dangerous bend 

lq 

rq 

quotes within quotes 
thinspace 

SWIFT 

JEVONS 


In modem Wit all printed Trash, is 

Set oft with num'rous Breaks and Dashes — 

— JONATHAN SWIFT, On Poetry: A Rapsody (1733) 

Some compositors still object to work 
in offices where type-composing machines are introduced. 
— WILLIAM STANLEY JEVONS, Political Economy (1878) 
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Your keyboard has very few keys compared to the large number of symbols 
that you may want to specify. In order to make a limited keyboard sufficiently 
versatile, one of the characters that you can type is reserved for special use, 
and it is called the escape character. Whenever you want to type something 
that controls the format of your manuscript, or something that doesn’t use the 
keyboard in the ordinary way, you should type the escape character followed by 
an indication of what you want to do. 

Note: Some computer terminals have a key marked ‘ESC’, but that is not 
your escape character! It is a key that sends a special message to the operating 
system, so don’t confuse it with what this manual calls “escape.” 

TgjX allows any character to be used for escapes, but the “backslash” 
character ‘V is usually adopted for this purpose, since backslashes are reasonably 
convenient to type and they are rarely needed in ordinary text. Things work out 
best when different TgX users do things consistently, so we shall escape via 
backslashes in all the examples of this manual. 

Immediately after typing ‘V (i.e., immediately after an escape character) 
you type a coded command telling TgX what you have in mind. Such commands 
are called control sequences. For example, you might type 

\ input MS 

which (as we will see later) causes TgX to begin reading a file called ‘MS.tex’; 
the string of characters ‘\ input’ is a control sequence. Here’s another example: 

George P Volya and Gabor SzegV'o. 

TgX converts this to ‘George Polya and Gabor Szego.’ There are two control 
sequences, V and \", here; these control sequences have been used to place 
accents over some of the letters. 

Control sequences come in two flavors. The first kind, like \ input, is 
called a control word ; it consists of an escape character followed by one or more 
letters, followed by a space or by something besides a letter. (Tf^X has to know 
where the control sequence ends, so you must put a space after a control word 
if the next character is a letter. For example, if you type ‘\inputMS’, TgX will 
naturally interpret this as a control word with seven letters.) In case you’re 
wondering what a “letter” is, the answer is that T^X normally regards the 52 
symbols A. . .Z and a. . .z as letters. The digits 0. . .9 are not considered to be 
letters, so they don’t appear in control sequences of the first kind. 

A control sequence of the other kind, like \ ’ , is called a control symbol ; 
it consists of the escape character followed by a single nonletter. In this case you 
don’t need a space to separate the control sequence from a letter that follows, 
since control sequences of the second kind always have exactly one symbol after 
the escape character. 

► EXERCISE 3.1 

What are the control sequences in ‘\I’m \exercise3 . 1\\ ! ’? 


escape character 

backslash 

control sequences 

markup commands, see control S€ 

input 

Polya 

Szego 

acute 

umlaut 


accents 
control word 
letter 

control symbol 
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► EXERCISE 3.2 

We’ve seen that the input P\ ’ olya yields ‘Polya’. Can you guess how the French 
words ‘mathematique’ and ‘centimetre’ should be specified? 

When a space comes after a control word (an all-letter control sequence), 
it is ignored by TpX; i.e., it is not considered to be a “real” space belonging to 
the manuscript that is being typeset. But when a space comes after a control 
symbol, it’s truly a space. 

Now the question arises, what do you do if you actually want, a space 
to appear after a control word? We will see later that TpX treats two or more 
consecutive spaces as a single space, so the answer is not going to be “type two 
spaces.” The correct answer is to type “control space,” namely 

\u 

(the escape character followed by a blank space); TpX will treat this as a space 
that is not to be ignored. Notice that control-space is a control sequence of the 
second kind, i.e., a control symbol, since there is a single nonletter ( u ) following 
the escape character. Two consecutive spaces are considered to be equivalent to 
a single space, so further spaces immediately following \ u will be ignored. But if 
you want to enter, say, three consecutive spaces into a manuscript you can type 
‘\u\u\u’- Incidentally, typists are often taught to put two spaces at the ends of 
sentences; but we will see later that TpX has its own way to produce extra space 
in such cases. Thus you needn’t be consistent in the number of spaces you type. 

f Nonprinting control characters like (return) might follow an escape character, 
and these lead to distinct control sequences according to the rules. T^X is 
initially set up to treat \ (return) and \(tab) the same as \ u (control space); these 
special control sequences should probably not be redefined, because you can’t see the 
difference between them when you look at them in a file. 

It is usually unnecessary for you to use “control space,” since control 
sequences aren’t often needed at the ends of words. But here’s an example that 
might shed some light on the matter: This manual itself has been typeset by 
TpX, and one of the things that occurs fairly often is the tricky logo ‘TpX’, which 
requires backspacing and lowering the E. There’s a special control word 

\TeX 

that produces the half-dozen or so instructions necessary to typeset ‘TpX’. When 
a phrase like ‘TpX ignores spaces after control words.’ is desired, the manuscript 
renders it as follows: 

\TeX\ ignores spaces after control words. 

Notice the extra \ following \TeX; this produces the control space that is neces- 
sary because TpX ignores spaces after control words. Without this extra \, the 
result would have been 


space 


jreturn^ 

itabi. 

carriage-return, s< 

logo 

TeX 


j return^ 


TpXignores spaces after control words. 
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On the other hand, you can’t simply put \ after \TeX in all contexts. For 
example, consider the phrase 

the logo ‘ \TeX’ . 

In this case an extra backslash doesn’t work at all; in fact, you get a curious 
result if you type 

the logo ‘ \TeX\ ’ . 

Can you guess what happens? Answer: The \ ’ is a control sequence denoting 
an acute accent, as in our P\’olya example above; the effect is therefore to put 
an accent over the next nonblank character, which happens to be a period. In 
other words, you get an accented period, and the result is 

the logo ‘TgX? 

Computers are good at following instructions, but not at reading your mind. 

Tj^X understands about 900 control sequences as part of its built-in 
vocabulary, and all of them are explained in this manual somewhere. But you 
needn’t worry about learning so many different things, because you won’t really 
be needing very many of them unless you are faced with unusually complicated 
copy. Furthermore, the ones you do need to learn actually fall into relatively 
few categories, so they can be assimilated without great difficulty. For example, 
many of the control sequences are simply the names of special characters used 
in math formulas; you type ‘\pi’ to get ‘7 r’, ‘\Pi’ to get ‘II’, ‘\aleph’ to get ‘N’, 
‘\infty’ to get ‘ 00 ’, ‘\le’ to get “<’, ‘\ge’ to get “>’, ‘\ne’ to get ‘yC, ‘\oplus’ to 
get ‘©’, ‘\otimes’ to get ‘O’. Appendix F contains several tables of such symbols. 



There’s no built-in relationship between uppercase and lowercase letters in 
control sequence names. For example, ‘\pi’ and ‘\Pi’ and API’ and Apl’ are 


four different control words. 


The 900 or so control sequences that were just mentioned actually aren’t 
the whole story, because it’s easy to define more. For example, if you want to 
substitute your own favorite names for math symbols, so that you can remember 
them better, you’re free to go right ahead and do it; Chapter 20 explains how. 

About 300 of T^X's control sequences are called primitive; these are the 
low-level atomic operations that are not decomposable into simpler functions. 
All other control sequences are defined, ultimately, in terms of the primitive 
ones. For example, \input is a primitive operation, but V and \" are not; the 
latter are defined in terms of an \accent primitive. 

People hardly ever use TgX’s primitive control sequences in their man- 
uscripts, because the primitives are . . . well ... so primitive. You have to type 
a lot of instructions when you are trying to make TgX do low-level things; this 
takes time and invites mistakes. It is generally better to make use of higher-level 
control sequences that state what functions are desired, instead of typing out 
the way to achieve each function each time. The higher-level control sequences 


pi 

Pi 

aleph 

infty 

le 

ge 

ne 

oplus 

otimes 

uppercase 

lowercase 

primitive 

input 


accent 
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need to be defined only once in terms of primitives. For example, \TeX is a con- 
trol sequence that means “typeset the TgX logo” ; V is a control sequence that 
means “put an acute accent over the next character”; and both of these con- 
trol sequences might require different combinations of primitives when the style 
of type changes. If TeX’s logo were to change, the author would simply have 
to change one definition, and the changes would appear automatically wherever 
they were needed. By contrast, an enormous amount of work would be necessary 
to change the logo if it were specified as a sequence of primitives each time. 

At a still higher level, there are control sequences that govern the overall 
format of a document. For example, in the present book the author typed 
‘\exercise’ just before stating each exercise; this \exercise command was 
programmed to make TgX do all of the following things: 

■ compute the exercise number (e.g., ‘3.2’ for the second exercise in Chap- 
ter 3); 

■ typeset 1 ►EXERCISE 3.2’ with the appropriate typefaces, on a line by 
itself, and with the triangle sticking out in the left margin; 

■ leave a little extra space just before that line, or begin a new page at 
that line if appropriate; 

■ prohibit beginning a new page just after that line; 

■ suppress indentation on the following line. 

It is obviously advantageous to avoid typing all of these individual instructions 
each time. And since the manual is entirely described in terms of high-level 
control sequences, it could be printed in a radically different format simply by 
changing a dozen or so definitions. 

f How can a person distinguish a IjrX primitive from a control sequence that 
has been defined at a higher level? There are two ways: (1) The index to this 
manual lists all of the control sequences that are discussed, and each primitive is marked 
with an asterisk. (2) You can display the meaning of a control sequence while running 
Tj?X. If you type ‘\show\cs’ where \cs is any control sequence, TfjX will respond 
with its current meaning. For example, ‘\show\input’ results in ‘> \input=\input . ’, 
because \input is primitive. On the other hand, ‘\show\thinspace’ yields 

> \thinspace=macro : 

->\kern . 16667em . 


This means that \thinspace has been defined as an abbreviation for ‘\kern . 16667em ’. 
By typing ‘\show\kern’ you can verify that \kern is primitive. The results of \show 
appear on your terminal and in the log file that you get after running TgX. 



► EXERCISE 3.3 

Which of the control sequences \ u and \ (return) is primitive? 


exercise 
show 
thinspace 
kern 
log file 
plain TgX 

basic control sequences 


In the following chapters we shall frequently discuss “plain TeX” for- 
mat, which is a set of about 600 basic control sequences that are defined in 
Appendix B. These control sequences, together with the 300 or so primitives, 
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are usually present when TpX begins to process a manuscript; that is why TpX 
claims to know roughly 900 control sequences when it starts. We shall see how 
plain TpX can be used to create documents in a flexible format that meets many 
people’s needs, using some typefaces that come with the TpX system. However, 
you should keep in mind that plain TpX is only one of countless formats that 
can be designed on top of TpX’s primitives; if you want some other format, it 
will usually be possible to adapt TpX so that it will handle whatever you have in 
mind. The best way to learn is probably to start with plain TpX and to change 
its definitions, little by little, as you gain more experience. 

f Appendix E contains examples of formats that can be added to Appendix B 
for special applications; for example, there is a set of definitions suitable for 
business correspondence. A complete specification of the format used to typeset this 
manual also appears in Appendix E. Thus, if your goal is to learn how to design JpX 
formats, you will probably want to study Appendix E while mastering Appendix B. 
After you have become skilled in the lore of control-sequence definition, you will prob- 
ably have developed some formats that other people will want to use; you should then 
write a supplement to this manual, explaining your style rules. 


The main point of these remarks, as far as novice TpX users are con- 
cerned, is that it is indeed possible to define nonstandard TpX control sequences. 
When this manual says that something is part of “plain T^jX,” it means that Tp)X 
doesn’t insist on doing things exactly that way; a person could change the rules 
by changing one or more of the definitions in Appendix B. But you can safely 
rely on the control sequences of plain TpX until you become an experienced 
T^Xmcal typist. 

(£)(£)*■ EXERCISE 3.4 

JL JL How many different control sequences of length 2 (including the escape char- 
acter) are possible? How many of length 3? 


formats 

SELDEN 

LINCOLN 


Syllables govern the world. 
— JOHN SELDEN, Table Talk (1689) 

l claim not to have controlled events, 
but confess plainly that events have controlled me. 

— ABRAHAM LINCOLN (1864) 
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Occasionally you will want to change from one typeface to another, for example 
if you wish to be bold or to emphasize something. TpX deals with sets of up 
to 256 characters called “fonts” of type, and control sequences are used to select 
a particular font. For example, you could specify the last few words of the first 
sentence above in the following way, using the plain Tp’X format of Appendix B: 

to be \bf bold \rm or to \sl emphasize \rm something. 

Plain Tp]X provides the following control sequences for changing fonts: 


\rm switches to the normal “roman” typeface: Roman 

\sl switches to a slanted roman typeface: Slanted 

\it switches to italic style: Italic 

\tt switches to a typewriter-like face: Typewriter 

\bf switches to an extended boldface style: Bold 


At the beginning of a run you get roman type (\rm) unless you specify otherwise. 

Notice that two of these faces have an “oblique” slope for emphasis: 
Slanted type is essentially the same as roman, hut the letters are slightly skewed, 
while the letters in italic type are drawn in a different style. (You can perhaps 
best appreciate the difference between the roman and italic styles by contemplat- 
ing letters that are in an unslanted italic face.) Typographic conventions are 
presently in a state of transition, because new technology has made it possible 
to do things that used to be prohibitively expensive; people are wrestling with 
the question of how much to use their new-found typographic freedom. Slanted 
roman type was introduced in the 1930s, but it first became widely used as 
an alternative to the conventional italic during the late 1970s. It can be bene- 
ficial in mathematical texts, since slanted letters are distinguishable from the 
italic letters in math formulas. The double use of italic type for two different 
purposes — for example, when statements of theorems are italicized as well as the 
names of variables in those theorems — has led to some confusion, which can now 
be avoided with slanted type. People are not generally agreed about the relative 
merits of slanted versus italic, but slanted type is rapidly becoming a favorite 
for the titles of books and journals in bibliographies. 

Special fonts are effective for emphasis, but not for sustained reading; 
your eyes would tire if long portions of this manual were entirely set in a bold 
or slanted or italic face. Therefore roman type accounts for the bulk of most 
typeset material. But it’s a nuisance to sav ‘\rm’ every time you want to go 
back to the roman style, so TgX provides an easier way to do it, using “curly 
brace” symbols: You can switch fonts inside the special symbols { and }, without 
affecting the fonts outside. For example, the displayed phrase at the beginning 
of this chapter is usually rendered 

to be {\bf bold} or to {\sl emphasize} something. 

This is a special case of the general idea of “grouping” that we shall discuss in 
the next chapter. It’s best to forget about the first way of changing fonts, and 


typeface 

bold 

fonts 

rm 

si 

it 

tt 

bf 

typewriter type 
face 

roman type 
oblique 
Slanted type 
italic type 
curly brace 
brace 
grouping 
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to use grouping instead; then your TgX manuscripts will look more natural, and 
you’ll probably never* have to type ‘\rm’. 

► EXERCISE 4.1 

Explain how to type the bibliographic reference ‘Ulrich Dieter, Journal fur die 
reine und angewandte Mathematik 201 (1959), 37-70.’ [Use grouping.] 

We have glossed over an important aspect of quality in the preceding 
discussion. Look, for example, at the italicized and slanted words in this sentence. 
Since italic and slanted styles slope to the right, the d’s stick into the spaces that 
separate these words from the roman type that follows; as a result, the spaces 
appear to be too skimpy, although they are correct at the base of the letters. 
To equalize the effective white space, Tj^X allows you to put the special control 
sequence ‘\/’ just before switching back to unslanted letters. When you type 

{\it italicized\/} and {\sl slanted\/} words 


you get italicized and slanted words that look better. The ‘\/’ tells T^X to add an 
“italic correction” to the previous letter, depending on that letter; this correction 
is about four times as much for an ‘/’ as for a ‘c’, in a typical italic font. 

Sometimes the italic correction is not desirable, because other factors 
take up the visual slack. The standard rule of thumb is to use \/ just before 
switching from slanted or italic to roman or bold, unless the next character is a 
period or comma. For example, type 

{\it italics\/} for {\it emphasis}. 

Old manuals of style say that the punctuation after a word should be in the same 
font as that word; but an italic semicolon often looks wrong, so this convention 
is changing. When an italicized word occurs just before a semicolon, the author 
recommends typing ‘{\it word\/};’. 


► EXERCISE 4.2 

Explain how to typeset a roman word in the midst of an italicized sentence. 


f Every letter of every font has an italic correction, which you can bring to life 
by typing \/. The correction is usually zero in unslanted styles, but there are 
exceptions: To typeset a bold ‘f ’ in quotes, you should say a bold ‘{\bf f\/}’, lest 
you get a bold ‘f . 


/>)/>>► EXERCISE 4.3 

JL JL Define a control sequence \ic such that ‘\ic c’ puts the italic correction of 
character c into TpX's register \dimen0. 



The primitive control sequence \nullfont stands for a font that has no char- 
acters. This font is always present, in case you haven’t specified any others. 


Dieter 

/ 

italic correction 

punctuation 

nullfont 


* Well . . . , hardly ever. 
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Fonts vary in size as well as in shape. For example, the font you are 
now reading is called a “10-point” font, because certain features of its design are 
10 points apart, when measured in printers’ units. (We will study the point 
system later; for now, it should suffice to point out that the parentheses around 
this sentence are exactly 10 points tall — and the em-dash is just 10 points wide.) 
The “dangerous bend” sections of this manual are set in 9-point type, the foot- 
notes in 8-point, subscripts in 7-point or 6-point, sub-subscripts in 5-point. 

Each font used in a T^X manuscript is associated with a control se- 
quence; for example, the 10-point font in this paragraph is called \tenrm, and 
the corresponding 9-point font is called \ninerm. The slanted fonts that match 
\tenrm and \ninerm are called \tensl and \ninesl. These control sequences 
are not built into TgX, nor are they the actual names of the fonts; TgX users are 
just supposed to make up convenient names, whenever new fonts are introduced 
into a manuscript. Such control sequences are used to change typefaces. 

When fonts of different sizes are used simultaneously, Tf’X will line the 
letters up according to their “baselines.” For example, if you type 

\tenrm smaller \ninerm and smaller 
\eightrm and smaller \sevenrm and smaller 
\sixrm and smaller \fiverm and smaller \tenrm 

the result is smaller and smaller and smaller and smaller and smaller and smaller. Of course 
this is something that authors and readers aren’t accustomed to, because printers 
couldn’t do such things with traditional lead types. Perhaps poets who wish 
to speak in a still small voice will cause future books to make use of frequent font 
variations, but nowadays it’s only an occasional font freak (like the author of this manual) 
who likes such experiments. One should not get too carried away by the prospect 
of font switching unless there is good reason. 

An alert reader might well be confused at this point because we started 
out this chapter by saying that ‘\rm’ is the command that switches to roman 
type, but later on we said that ‘\tenrm’ is the way to do it. The truth is that 
both ways work. But it has become customary to set things up so that \rm means 
“switch to roman type in the current size” while \tenrm means “switch to roman 
type in the 10-point size.” In plain TgX format, nothing but 10-point fonts are 
provided, so \rm will always get you \tenrm; but in more complicated formats the 
meaning of \rm will change in different parts of the manuscript. For example, in 
the format used by the author to typeset this manual, there’s a control sequence 
‘\tenpoint’ that causes \rm to mean \tenrm, \sl to mean \tensl, and so on, 
while ‘\ninepoint’ changes the definitions so that \rm means \ninerm, etc. 
There’s another control sequence used to introduce the quotations at the end of 
each chapter; when the quotations are typed, \rm and \sl temporarily stand for 
8-point unslanted sans-serif type and 8-point slanted sans-serif type , respectively. 
This device of constantly redefining the abbreviations \rm and \sl, behind the 
scenes, frees the typist from the need to remember what size or style of type is 
currently being used. 


points 

dangerous bend 

subscripts 
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ninesl 
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tenpoint 

ninepoint 
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► EXERCISE 4.4 

Why do you think the author chose the names ‘\tenpoint’ and ‘\tenrm’, etc., 
instead of ‘\10point’ and ‘\10rm’? 


/£> ►EXERCISE 4.5 

JL Suppose that you have typed a manuscript using slanted type for emphasis, 
but your editor suddenly tells you to change all the slanted to italic. What’s an easy 
way to do this? 


f Eacli font has an external name that identifies it with respect to all other fonts 
in a particular library. For example, the font in this sentence is called ‘cmr9’, 
which is an abbreviation for “Computer Modern Roman 9 point.” In order to prepare 
TgX for using this font, the command 


\font\ninerm=cmr9 


appears in Appendix E. In general you say ‘\font\cs={external font name)’ to load 
the information about a particular font into TfrX’s memory; afterwards the control 
sequence \cs will select that font for typesetting. Plain TgX makes only sixteen fonts 
available initially (see Appendix B and Appendix F), but you can use \font to access 
anything that exists in your system’s font library. 

f it is often possible to use a font at several different sizes, by magnifying or 
shrinking the character images. Each font has a so-called design size, which 
reflects the size it normally has by default; for example, the design size of cmr9 is 
9 points. But on many systems there is also a range of sizes at which you can use 
a particular font, by scaling its dimensions up or down. To load a scaled font into 
TJrX’s memory, you simply say ‘\f ont\cs=(external font name) at (desired size)’. For 
example, the command 


\font\magnif iedf iverm=cmr5 at lOpt 

brings in 5-point Computer Modern Roman at twice its normal size. (Caution: Before 
using this ‘at’ feature, you should check to make sure that your typesetter supports 
the font at the size in question; TgX will accept any (desired size) that is positive and 
less than 2048 points, but the final output will not be right unless the scaled font really 
is available on your printing device.) 

f What’s the difference between cmr5 at lOpt and the normal 10-point font, 
cmrlO? Plenty; a well-designed font will be drawn differently at different point 
sizes, and the letters will often have different relative heights and widths, in order to 
enhance readability. 

Ten point type is different from magnified five-point type. 


It is usually best to scale fonts only slightly with respect to their design size, unless 
the final product is going to be photographically reduced after TpX has finished with 
it, or unless you are trying for an unusual effect. 



Another way to magnify a font is to specify a scale factor that is relative to 
the design size. For example, the command 


Computer Modern 

cm fonts 

font 

design size 
at 

magnification 

reduction 


\f ont\magnif iedf iverm=cmr5 scaled 2000 
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is another way to bring in the font cmr5 at double size. The scale factor is specified 
as an integer that represents a magnification ratio times 1000. Thus, a scale factor of 
1200 specifies magnification by 1.2, etc. 

► EXERCISE 4.6 

State two ways to load font cmrlO into TjgX’s memory at half its normal size. 

At many computer centers it has proved convenient to supply fonts at magni- 
fications that grow in geometric ratios — something like well-tempered tuning 
on a piano. The idea is to have all fonts available at their true size as well as at 
magnifications 1.2 and 1.44 (which is 1.2 x 1.2); perhaps also at magnification 1.728 
(= 1.2 x 1.2 x 1.2) and even higher. Then you can magnify an entire document by 1.2 
or 1.44 and still stay within the set of available fonts. Plain T)gX provides the abbre- 
viations \magstepO for a scale factor of 1000, \magstepl for a scaled factor of 1200, 
\magstep2 for 1440, and so on up to \magstep5. You say, for example, 

\font\bigtenrm=cmrlO scaled\magstep2 
to load font cmrlO at 1.2 x 1.2 times its normal size. 

“This is cmrlO at normal size (\magstepO).” 

“This is cmrlO scaled once by 1.2 (\magstepl).” 

“This is cmrlO scaled twice by 1.2 (\magstep2).” 

(Notice that a little magnification goes a long way.) There’s also \magstephalf , which 
magnifies by \/L2, i.e., halfway between steps 0 and 1. 

f Chapter 10 explains how to apply magnification to an entire document, over 
and above any magnification that has been specified when fonts are loaded. 
For example, if you have loaded a font that is scaled by \magstepl and if you also 
specify \magnif ication=\magstep2, the actual font used for printing will be scaled by 
\magstep3. Similarly, if you load a font scaled by \magstephalf and if you also say 
\magnif ication=\magstephalf , the printed results will be scaled by \magstepl. 


cmr5 

piano 

magstep 

magstephalf 

magnification 

LEE 

SHAKESPEARE 


Type faces — like people’s faces — have distinctive features 
indicating aspects of character. 
— MARSHALL LEE, Bookmaking (1965) 

This was the Noblest Roman of them all. 
— WILLIAM SHAKESPEARE, The Tragedie of Julius Caesar (1599) 
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Every once in a while it is necessary to treat part of a manuscript as a unit, so you 
need to indicate somehow where that part begins and where it ends. For this 
purpose T^X gives special interpretation to two “grouping characters,” which 
(like the escape character) are treated differently from the normal symbols that 
you type. We assume in this manual that { and } are the grouping characters, 
since they are the ones used in plain TgX. 

We saw examples of grouping in the previous chapter, where it was men- 
tioned that font changes inside a group do not affect the fonts in force outside. 
The same principle applies to almost anything else that is defined inside a group, 
as we will see later; for example, if you define a control sequence within some 
group, that definition will disappear when the group ends. In this way you 
can conveniently instruct TgX to do something unusual, by changing its normal 
conventions temporarily inside of a group; since the changes are invisible from 
outside the group, there is no need to worry about messing up the rest of a 
manuscript by forgetting to restore the normal conventions when the unusual 
construction has been finished. Computer scientists have a name for this aspect 
of grouping, because it’s an important aspect of programming languages in gen- 
eral; they call it “block structure,” and definitions that are in force only within 
a group are said to be “local” to that group. 

You might want to use grouping even when you don’t care about block 
structure, just to have better control over spacing. For example, let’s consider 
once more the control sequence \TeX that produces the logo ‘TgX’ in this manual: 
We observed in Chapter 3 that a blank space after this control sequence will be 
gobbled up unless one types ‘\TeX\ ’, yet it is a mistake to say ‘\TeX\’ when the 
following character is not a blank space. Well, in all cases it would be correct to 
specify the simple group 

{\TeX> 

whether or not the following character is a space, because the } stops TgX from 
absorbing an optional space into \TeX. This might come in handy when you’re 
using a text editor (e.g., when replacing all occurrences of a particular word by 
a control sequence). Another thing you could do is type 

\TeXO 

using an empty group for the same purpose: The ‘O’ here is a group of no 
characters, so it produces no output, but it does have the effect of stopping Ty^X 
from skipping blanks. 

► EXERCISE 5.1 

Sometimes you run into a rare word like ‘shelfful’ that looks better as ‘shelfful’ 
without the ‘ff’ ligature. How can you fool Tj;X into thinking that there aren’t 
two consecutive f’s in such a word? 



grouping characters 

curly braces, see braces 

block structure 

local 

TeX 

space 

empty group 
lbrace rbrace 
ligature 
control space 


► EXERCISE 5.2 

Explain how to get three blank spaces in a row without using ‘\ u ’. 
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But T^jX also uses grouping for another, quite different, purpose, namely 
to determine how much of your text is to be governed by certain control se- 
quences. For example, if you want to center something on a line you can type 

\centerline{This information should be centered.} 

using the control sequence \centerline defined in plain Tf^X format. 

Grouping is used in quite a few of T^X's more intricate instructions; 
and it’s possible to have groups within groups within groups, as you can see by 
glancing at Appendix B. Complex grouping is generally unnecessary, however, 
in ordinary manuscripts, so you needn’t worry about it. Just don’t forget to 
finish each group that you’ve started, because a lost “}’ might cause trouble. 
Here’s an example of two groups, one nested inside the other: 

\centerline{This information should be {\it centered}.} 

As you might expect, T^X will produce a centered line that also contains italics: 

This information should be centered. 

But let’s look at the example more closely: ‘\centerline’ appears outside the 
curly braces, while ‘\it ’ appears inside. Why are the two cases different? And 
how can a beginner learn to remember which is which? Answer: \centerline 
is a control sequence that applies only to the very next thing that follows, so 
you want to put braces around the text that is to be centered (unless that text 
consists of a single symbol or control sequence). For example, to center the T^X 
logo on a line, it would suffice to type ‘\centerline\TeX’, but to center the 
phrase ‘TgX has groups’ you need braces: ‘\centerline{\TeX\ has groups}’. 
On the other hand, \it is a control sequence that simply means “change the 
current font” ; it acts without looking ahead, so it affects everything that follows, 
at least potentially. The braces surround \it in order to confine the font change 
to a local region. 

In other words, the two sets of braces in this example actually have 
different functions: One serves to treat several words of the text as if they were 
a single object, while the other provides local block structure. 

► EXERCISE 5.3 

What do you think happens if you type the following: 

\centerline{This information should be {centered}.} 
\centerline So should this. 

► EXERCISE 5.4 

And how about this one? 

\centerline{This information should be \it centered.} 
►EXERCISE 5.5 

JL Define a control sequence \ital so that a user could type l \ital{text}’ in- 
stead of ‘{\it text\/}’. Discuss the pros and cons of \ital versus \it. 


centerline 

nested 
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/>)/>) Subsequent chapters describe many primitive operations of TjrjX for which 
JL JL the locality of grouping is important. For example, if one of Tj?X’s internal 
parameters is changed within a group, the previous contents of that parameter will 
be restored when the group ends. Sometimes, however, it's desirable to make a def- 
inition that transcends its current group. This effect can be obtained by prefixing 
‘\global’ to the definition. For example, TpX keeps the current page number in a 
register called \countO, and the routine that outputs a page wants to increase the page 
number. Output routines are always protected by enclosing them in groups, so that 
they do not inadvertently mess up the rest of lj?X; but the change to \countO would 
disappear if it were kept local to the output group. The command 

\global\advance\countO by 1 

solves the problem; it increases \countO and makes this value stick around at the end 
of the output routine. In general, \global makes the immediately following definition 
pertain to all existing groups, not just to the innermost one. 

<§)(£)*■ EXERCISE 5.6 

JL JL If you think you understand local and global definitions, here’s a little test to 
make sure: Suppose \c stands for ‘\countl=’, \g stands for ‘\global\countl=’, and \s 
stands for ‘\showthe\countl’. What values will be shown? 

{\cl\s\g2{\s\c3\s\g4\s\c5\s}\s\c6\s}\s 

/gb/gb Another way to obtain block structure with TpX is to use the primitives 
JL JL \begingroup and \endgroup. These control sequences make it easy to be- 
gin a group within one control sequence and end it within another. The text that Tj?X 
actually executes, after control sequences have been expanded, must have properly 
nested groups, i.e., groups that don’t overlap. For example, 

{ \begingroup } \endgroup 

is not legitimate. 

(£)(£)*■ EXERCISE 5.7 

JL JL Define control sequences \beginthe (block name) and \endthe(block name) 
that provide a “named” block structure. In other words, 

\beginthe{beguine}\beginthe{waltz}\endthe{waltz}\endthe{beguine} 

should be permissible, but not 

\beginthe{beguine}\beginthe{waltz}\endthe{beguine}\endthe{waltz} . 


global 

page number 
Output routines 
advance 
begingroup 
endgroup 
nested groups 
MUIRHEAD 
HOWARD 


I have had recourse to varieties of type, 
and to braces. 

— JAMES MUIRHEAD, The Institutes of Gaius (1880) 

An encounter group is a gathering, for a few hours or a few days, 
of twelve or eighteen personable, responsible, certifiably normal 

and temporarily smelly people. 
— JANE HOWARD, Please Touch (1970) 
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The best way to learn how to use T^X is to use it. Thus, it’s high time for you 
to sit down at a computer terminal and interact with the Tf^X system, trying 
things out to see what happens. Here are some small but complete examples 
suggested for your first encounter. 

Caution: This chapter is rather a long one. Why don’t you stop reading 
now, and come back fresh tomorrow? 

OK, let’s suppose that you’re rested and excited about having a trial run 
of TgX. Step-by-step instructions for using it appear in this chapter. First do 
this: Go to the lab where the graphic output device is, since you will be wanting 
to see the output that you get — it won’t really be satisfactory to run TgX from 
a remote location, where you can’t hold the generated documents in your own 
hands. Then log in; and start TgX. (You may have to ask somebody how to 
do this on your local computer. Usually the operating system prompts you for 
a command and you type ‘TeX’ or ‘run tex’ or something like that.) 

When you’re successful, TpX will welcome you with a message such as 

This is TeX, Version 3.14 (preloaded format=plain 89.7.15) 

** 

The ‘**’ is TgX’s way of asking you for an input file name. 

Now type ‘\relax’ (including the backslash), and (return) (or whatever 
is used to mean “end-of-line” on your terminal). T^X is all geared up for action, 
ready to read a long manuscript; but you’re saying that it’s all right to take 
things easy, since this is going to be a real simple run. In fact, \relax is a 
control sequence that means “do nothing.” 

The machine will type another asterisk at you. This time type something 
like ‘Hello?’ and wait for another asterisk. Finally type ‘\end’, and stand back 
to see what happens. 

TgjX should respond with ‘ [1] ’ (meaning that it has finished page 1 of 
your output); then the program will halt, probably with some indication that 
it has created a file called ‘texput.dvi’. (TgX uses the name texput for its 
output when you haven’t specified any better name in your first line of input; 
and dvi stands for “device independent,” since texput.dvi is capable of being 
printed on almost any kind of typographic output device.) 

Now you’re going to need some help again from your friendly local com- 
puter hackers. They will tell you how to produce hardcopy from texput.dvi. 
And when you see the hardcopy — Oh, glorious day! — you will see a magnificent 
‘Hello?’ and the page number T’ at the bottom. Congratulations on your first 
masterpiece of fine printing. 

The point is, you understand now how to get something through the 
whole cycle. It only remains to do the same thing with a somewhat longer 
document. So our next experiment will be to work from a file instead of typing 
the input online. 


Running the program 
** 

relax 

return 

asterisk 

end 

lb 

texput 
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Use your favorite text editor to create a file called story.tex that con- 
tains the following 18 lines of text (no more, no less): 

1 \hrule 

2 \vskip lin 

3 \centerline{\bf A SHORT STORY} 

4 \vskip 6pt 

5 \centerline{\sl by A. U. Thor} 

6 \vskip .5cm 

7 Once upon a time, in a distant 

8 galaxy called \"0\"o\c c, 

9 there lived a computer 

10 named R.~J. Drofnats. 

11 

12 Mr. "Drofnats or ‘ ‘R. J.,’’ as 

13 he preferred to be called 

li was happiest when he was at work 

is typesetting beautiful documents. 

16 \vskip lin 

17 \hrule 

18 \vfill\eject 

(Don’t type the numbers at the left of these lines, of course; they are present only 
for reference.) This example is a bit long, and more than a bit silly; but it’s no 
trick for a good typist like you and it will give you some worthwhile experience, 
so do it. For your own good. And think about what you’re typing, as you go; 
the example introduces a few important features of T^X that you can learn as 
you’re making the file. 

Here is a brief explanation of what you have just typed: Lines 1 and 17 
put a horizontal rule (a thin line) across the page. Lines 2 and 16 skip past one 
inch of space; ‘\vskip’ means “vertical skip,” and this extra space will separate 
the horizontal rules from the rest of the copy. Lines 3 and 5 produce the title and 
the author name, centered, in boldface and in slanted type. Lines 4 and 6 put 
extra white space between those lines and their successors. (We shall discuss 
units of measure like ‘6pt’ and ‘.5cm’ in Chapter 10.) 

The main bulk of the story appears on lines 7-15, and it consists of 
two paragraphs. The fact that line 11 is blank informs T}]X that line 10 is the 
end of the first paragraph; and the ‘\vskip’ on line 16 implies that the second 
paragraph ends on line 15, because vertical skips don’t appear in paragraphs. 
Incidentally, this example seems to be quite full of TgX commands; but it is 
atypical in that respect, because it is so short and because it is supposed to 
be teaching things. Messy constructions like \vskip and \centerline can be 
expected at the very beginning of a manuscript, unless you’re using a canned 
format, but they don’t last long; most of the time you will find yourself typing 
straight text, with relatively few control sequences. 
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And now comes the good news, if you haven’t used computer typesetting 
before: You don’t have to worry about where to break lines in a paragraph (i.e. , 
where to stop at the right margin and to begin a new line), because TgX will 
do that for you. Your manuscript file can contain long lines or short lines, or 
both; it doesn’t matter. This is especially helpful when you make changes, since 
you don’t have to retype anything except the words that changed. Every time 
you begin a new line in your manuscript file it is essentially the same as typing 
a space. When TgX has read an entire paragraph — in this case lines 7 to 11 — it 
will try to break up the text so that each line of output, except the last, contains 
about the same amount of copy; and it will hyphenate words if necessary to keep 
the spacing consistent, but only as a last resort. 

Line 8 contains the strange concoction 

\"0\"o\c c 

and you already know that \" stands for an umlaut accent. The \c stands for a 
“cedilla,” so you will get 'Ode;’ as the name of that distant galaxy. 

The remaining text is simply a review of the conventions that we dis- 
cussed long ago for dashes and quotation marks, except that the signs in 
lines 10 and 12 are a new wrinkle. These are called ties , because they tie words 
together; i.e., TgjX is supposed to treat as a normal space but not to break 
between lines there. A good typist will use ties within names, as shown in our 
example; further discussion of ties appears in Chapter 14. 

Finally, line 18 tells I^X to ‘\vf ill’, i.e., to fill the rest of the page with 
white space; and to ‘\eject’ the page, i.e., to send it to the output file. 

Now you’re ready for Experiment 2: Get TgX going again. This time 
when the machine says ‘**’ you should answer ‘story’, since that is the name 
of the file where your input resides. (The file could also be called by its full 
name ‘story.tex’, but T^X automatically supplies the suffix ‘.tex’ if no suffix 
has been specified.) 

You might wonder why the first prompt was ‘**’, while the subsequent 
ones are ‘*’; the reason is simply that the first thing you type to Tj^X is slightly 
different from the rest: If the first character of your response to ‘**’ is not a 
backslash, TgX automatically inserts ‘\input’. Thus you can usually run TgX 
by merely naming your input file. (Previous Tf^X systems required you to start 
by typing ‘\input story’ instead of ‘story’, and you can still do that; but most 
TgX users prefer to put all of their commands into a file instead of typing them 
online, so T^X now spares them the nuisance of starting out with \ input each 
time.) Recall that in Experiment 1 you typed ‘\relax’; that started with a 
backslash, so \ input was not implied. 

f There’s actually another difference between ‘**’ and If the first character 
after ** is an ampersand ( ‘ft’ ), T^X will replace its memory with a precom- 
puted format file before proceeding. Thus, for example, you can type ‘ftplain \input 
story’ or even ‘&plain story’ in response to if you are running some version of 
T^rX that might not have the plain format preloaded. 
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f Incidentally, many systems allow you to invoke TjgX by typing a one-liner like 
‘tex story’ instead of waiting for the similarly, ‘tex \relax’ works for 
Experiment 1, and ‘tex ftplain story’ loads the plain format before inputting the 
story file. You might want to try this, to see if it works on your computer, or you 
might ask somebody if there’s a similar shortcut. 

As TeX begins to read your story file, it types ‘(story.tex’, possibly 
with a version number for more precise identification, depending on your local 
operating system. Then it types ‘[1]’, meaning that page 1 is done; and ‘)’, 
meaning that the file has been entirely input. 

Tj;X will now prompt you with V, because the file did not contain 
‘\end’. Enter \end into the computer now, and you should get a file story. dvi 
containing a typeset version of Thor’s story. As in Experiment 1, you can proceed 
to convert story. dvi into hardcopy; go ahead and do that now. The typeset 
output won’t be shown here, but you can see the results by doing the experiment 
personally. Please do so before reading on. 

► EXERCISE 6.1 

Statistics show that only 7.43 of 10 people who read this manual actually type 
the story.tex file as recommended, but that those people learn TgX best. So 
why don’t you join them? 

► EXERCISE 6.2 

Look closely at the output of Experiment 2, and compare it to story . tex : If you 
followed the instructions carefully, you will notice a typographical error. What 
is it, and why did it sneak in? 

With Experiment 2 under your belt, you know how to make a document 
from a file. The remaining experiments in this chapter are intended to help 
you cope with the inevitable anomalies that you will run into later; we will 
intentionally do things that will cause TeX to “squeak.” 

But before going on, it’s best to fix the error revealed by the previous 
output (see exercise 6.2): Line 13 of the story.tex file should be changed to 

he preferred to be called °/, error has been fixed! 

The sign here is a feature of plain Tp;X that we haven’t discussed before: It ef- 
fectively terminates a line of your input file, without introducing the blank space 
that TeX ordinarily inserts when moving to the next line of input. Furthermore, 
TeX ignores everything that you type following a °/„, up to the end of that line 
in the file; you can therefore put comments into your manuscript, knowing that 
the comments are for your eyes only. 

Experiment 3 wfill be to make TeX work harder, by asking it to set 
the story in narrower and narrower columns. Here’s how: After starting the 
program, type 


end 

percent 

comments 


\hsize=4in \input story 
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in response to the This means, “Set the story in a 4-inch column.” More 
precisely, \hsize is a primitive of TpX that specifies the horizontal size, i.e., the 
width of each normal line in the output when a paragraph is being typeset; and 
\input is a primitive that causes T^X to read the specified file. Thus, you are 
instructing the machine to change the normal setting of \hsize that was defined 
by plain TgX, and then to process story.tex under this modification. 

TgX should respond by typing something like ‘(story.tex [1])’ as 
before, followed by Now you should type 

\hsize=3in \input story 

and, after TgX says ‘(story.tex [2] )’ asking for more, type three more lines 

\hsize=2.5in \input story 
\hsize=2in \input story 
\end 


hsize 

input 

overfull box 


to complete this four-page experiment. 

Don’t be alarmed when TgX screams ‘Overfull \hbox’ several times 
as it works at the 2-inch size; that’s what was supposed to go wrong during 
Experiment 3. There simply is no good way to break the given paragraphs into 
lines that are exactly two inches wide, without making the spaces between words 
come out too large or too small. Plain Tp]X has been set up to ensure rather 
strict tolerances on all of the lines it produces: 

you don’t get spaces between words narrower than this, and 
you don’t get spaces between words wider than this. 

If there’s no way to meet these restrictions, you get an overfull box. And with 
the overfull box you also get (1) a warning message, printed on your terminal, 
and (2) a big black bar inserted at the right of the offending box, in your output. 
(Look at page 4 of the output from Experiment 3; the overfull boxes should stick 
out like sore thumbs. On the other hand, pages 1-3 should be perfect.) 

Of course you don’t want overfull boxes in your output, so T^X provides 
several ways to remove them; that will be the subject of our Experiment 4. But 
first let’s look more closely at the results of Experiment 3, since TgX reported 
some potentially valuable information when it was forced to make those boxes 
too full; you should learn how to read this data: 

Overfull \hbox (0.98807pt too wide) in paragraph at lines 7 — 11 
\tenrm tant galaxy called []0~~?o~~Xc, there lived | 

Overfull \hbox (0.4325pt too wide) in paragraph at lines 7 — 11 
\tenrm a com-puter named R. J. Drof-nats. | 

Overfull \hbox (5.32132pt too wide) in paragraph at lines 12 — 16 
\tenrm he pre-f erred to be called was hap- | 


Each overfull box is correlated with its location in your input file (e.g., the first 
two were generated when processing the paragraph on lines 7-11 of story.tex), 
and you also learn by how much the copy sticks out (e.g., 0.98807 points). 
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Notice that TgX also shows the contents of the overfull boxes in ab- 
breviated form. For example, the last one has the words ‘he preferred to be 
called — was hap-’, set in font \tenrm (10-point roman type); the first one has 
a somewhat curious rendering of ‘Oog’, because the accents appear in strange 
places within that font. In general, when you see ‘ [] ’ in one of these messages, 
it stands either for the paragraph indentation or for some sort of complex con- 
struction; in this particular case it stands for an umlaut that has been raised up 
to cover an ‘O’. 



► EXERCISE 6.3 

Can you explain the 1 1 ’ that appears after ‘lived’ in that message? 



► EXERCISE 6.4 

Why is there a space before the 1 1 ’ in ‘Drof-nats. 


I’? 


You don’t have to take out pencil and paper in order to write down the 
overfull box messages that you get before they disappear from view, since TgX 
always writes a “transcript” or “log file” that records what happened during each 
session. For example, you should now have a file called story.log containing 
the transcript of Experiment 3, as well as a file called texput.log containing 
the transcript of Experiment 1. (The transcript of Experiment 2 was probably 
overwritten when you did number 3.) Take a look at story.log now; you will 
see that the overfull box messages are accompanied not only by the abbreviated 
box contents, but also by some strange-looking data about hboxes and glue and 
kerns and such things. This data gives a precise description of what’s in that 
overfull box; TgX wizards will find such listings important, if they are called 
upon to diagnose some mysterious error, and you too may want to understand 
TgX’s internal code some day. 

The abbreviated forms of overfull boxes show the hyphenations that 
TgX tried before it resorted to overfilling. The hyphenation algorithm, which is 
described in Appendix H, is excellent but not perfect; for example, you can see 
from the messages in story.log that I^X finds the hyphen in ‘pre-ferred’, and 
it can even hyphenate ‘Drof-nats’. Yet it discovers no hyphen in ‘galaxy’, and 
every once in a while an overfull box problem can be cured simply by giving Tf^X 
a hint about how to hyphenate some word more completely. (We will see later 
that there are two ways to do this, either by inserting discretionary hyphens 
each time as in ‘gal\-axy’, or by saying ‘\hyphenation{gal-axy}’ once at the 
beginning of your manuscript.) 

In the present example, hyphenation is not a problem, since TgX found 
and tried all the hyphens that could possibly have helped. The only way to get 
rid of the overfull boxes is to change the tolerance, i.e., to allow wider spaces 
between words. Indeed, the tolerance that plain TgX uses for wide lines is 
completely inappropriate for 2-inch columns; such narrow columns simply can’t 
be achieved without loosening the constraints, unless you rewrite the copy to fit. 

T^X assigns a numerical value called “badness” to each line that it sets, 
in order to assess the quality of the spacing. The exact rules for badness are 


[] 

transcript 
log file 
hyphenation 
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badness 
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different for different fonts, and they will be discussed in Chapter 14; but here 
is the way badness works for the roman font of plain TgX: 

The badness of this line is 100. (very tight) 

The badness of this line is 12. (somewhat tight) 

The badness of this line is 0. (perfect) 

The badness of this line is 12. (somewhat loose) 

The badness of this line is 200. (loose) 

The badness of this line is 1000. (bad) 

The badness of this line is 5000. (awful) 

Plain TgX normally stipulates that no line’s badness should exceed 200; but in 
our case, the task would be impossible since 

‘tant galaxy called Ode;, there’ has badness 1521; 

‘he preferred to be called — was’ has badness 568. 

So we turn now to Experiment 4, in which spacing variations that are more 
appropriate to narrow columns will be used. 

Run TgX again, and begin this time by saying 

\hsize=2in \tolerance=1600 \input story 

so that lines with badness up to 1600 will be tolerated. Hurray! There are no 
overfull boxes this time. (But you do get a message about an underfull box, 
since TgX reports all boxes whose badness exceeds a certain threshold called 
\hbadness; plain TprjX sets \hbadness=1000.) Now make Tf^X work still harder 
by trying 

\hsize=1.5in \input story 

(thus leaving the tolerance at 1600 but making the column width still skimpier). 
Alas, overfull boxes return; so try typing 

\tolerance=10000 \input story 

in order to see what happens. TgX treats 10000 as if it were “infinite” tolerance, 
allowing arbitrarily wide space; thus, a tolerance of 10000 will never produce an 
overfull box, unless something strange occurs like an unhyphenatable word that 
is wider than the column itself. 

The underfull box that TgX produces in the 1.5-inch case is really bad; 
with such narrow limits, an occasional wide space is unavoidable. But try 

\raggedright \input story 

for a change. (This tells TgX not to worry about keeping the right margin 
straight, and to keep the spacing uniform within each line.) Finally, type 

\hsize=.75in \input story 

followed by ‘\end’, to complete Experiment 4. This makes the columns almost 
impossibly narrow. 
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f Tlie output from this experiment will give you some feeling for the problem 
of breaking a paragraph into approximately equal lines. When the lines are 
relatively wide, TpX will almost always find a good solution. But otherwise you will 
have to figure out some compromise, and several options are possible. Suppose you want 
to ensure that no lines have badness exceeding 500. Then you could set \tolerance to 
some high number, and \hbadness=500; TgX would not produce overfull boxes, but it 
would warn you about the underfull ones. Or you could set \tolerance=500; then TjgX 
might produce overfull boxes. If you really want to take corrective action, the second 
alternative is better, because you can look at an overfull box to see how much sticks 
out; it becomes graphically clear what remedies are possible. On the other hand, if you 
don’t have time to fix bad spacing — if you just want to know how bad it is — then the 
first alternative is better, although it may require more computer time. 

►EXERCISE 6.5 

JL When \raggedright has been specified, badness reflects the amount of space 
at the right margin, instead of the spacing between words. Devise an experiment by 
which you can easily determine what badness TjgX assigns to each line, when the story 
is set ragged-right in 1.5-inch columns. 

f A parameter called \hfuzz allows you to ignore boxes that are only slightly 
overfull. For example, if you say \hfuzz=lpt, a box must stick out more than 
one point before it is considered erroneous. Plain TjgX sets \hfuzz=0. lpt. 

(£)(£)*■ EXERCISE 6.6 

JL JL Inspection of the output from Experiment 4, especially page 3, shows that 
with narrow columns it would be better to allow white space to appear before and 
after a dash, whenever other spaces in the same line are being stretched. Define a 
\dash macro that does this. 


You were warned that this is a long chapter. But take heart: There’s 
only one more experiment to do, and then you will know enough about TgX to 
run it fearlessly by yourself forever after. The only thing you are still missing 
is some information about how to cope with error messages — i.e., not just with 
warnings about things like overfull boxes, but with cases where Tj\X actually 
stops and asks you what to do next. 

Error messages can be terrifying when you aren’t prepared for them; 
but they can be fun when you have the right attitude. Just remember that you 
really haven’t hurt the computer’s feelings, and that nobody will hold the errors 
against you. Then you’ll find that running TgX might actually be a creative 
experience instead of something to dread. 

The first step in Experiment 5 is to plant two intentional mistakes in 
the story.tex file. Change line 3 to 

\centerline{\bf A SHORT \ERR0R STORY} 

and change ‘\vskip’ to ‘\vship’ on line 2. 

Now run Tj^X again; but instead of ‘story’ type ‘sorry’. The computer 
should respond by saying that it can’t find file sorry.tex, and it will ask you 
to try again. Just hit (return) this time; you’ll see that you had better give the 
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name of a real file. So type ‘story’ and wait for TpX to find one of the faux pas ? 

in that file. ' inserting text online 

online interaction, see interaction 

Ah yes, the machine will soon stop,* after typing something like this: interacting with TeX 

! Undefined control sequence. 

1.2 \vship 

lin 

7 

TpX begins its error messages with ‘ ! ’, and it shows what it was reading at the 
time of the error by displaying two lines of context. The top line of the pair 
(in this case ‘\vship’) shows what TpX has looked at so far, and where it came 
from (‘1.2’, i.e., line number 2); the bottom line (in this case Tin’) shows what 
TpX has yet to read. 

The “?’ that appears after the context display means that TpX wants 
advice about what to do next. If you’ve never seen an error message before, or 
if you’ve forgotten what sort of response is expected, you can type “?’ now (go 
ahead and try it!); TpX will respond as follows: 

Type <return> to proceed, S to scroll future error messages, 

R to run without stopping, Q to run quietly, 

I to insert something, E to edit your file, 

1 or ... or 9 to ignore the next 1 to 9 tokens of input , 

H for help, X to quit. 

This is your menu of options. You may choose to continue in various ways: 

1. Simply type (return). TpX will resume its processing, after attempting 
to recover from the error as best it can. 

2. Type ‘S’. TpX will proceed without pausing for instructions if further 
errors arise. Subsequent error messages will flash by on your terminal, 
possibly faster than you can read them, and they will appear in your 
log file where you can scrutinize them at your leisure. Thus, ‘S’ is sort 
of like typing (return) to every message. 

3. Type ‘R’. This is like ‘S’ but even stronger, since it tells TpX not to stop 
for any reason, not even if a file name can’t be found. 

4. Type ‘ Q ’. This is like ‘R’ but even more so, since it tells TpX not only to 
proceed without stopping but also to suppress all further output to your 
terminal. It is a fast, but somewhat reckless, way to proceed (intended 
for running TpX with no operator in attendance). 

5. Type ‘I’, followed by some text that you want to insert. TpX will read 
this line of text before encountering what it would ordinarily see next. 

Lines inserted in this way are not assumed to end with a blank space. 


* Some installations of T^X do not allow interaction. In such cases all you can do is 
look at the error messages in your log file, where they will appear together with the 
“help” information. 
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6. Type a small number (less than 100). TeX will delete this many charac- 
ters and control sequences from whatever it is about to read next, and 
it will pause again to give you another chance to look things over. 

7. Type ‘H’. This is what you should do now and whenever you are faced 
with an error message that you haven’t seen for a while. TeX has two 
messages built in for each perceived error: a formal one and an informal 
one. The formal message is printed first (e.g., ?! Undefined control 
sequence.’); the informal one is printed if you request more help by 
typing ‘H’, and it also appears in your log file if you are scrolling error 
messages. The informal message tries to complement the formal one by 
explaining what TeX thinks the trouble is, and often by suggesting a 
strategy for recouping your losses. 

8. Type ‘X’. This stands for “exit.” It causes TgX to stop working on your 
job, after putting the finishing touches on your log file and on any pages 
that have already been output to your dvi file. The current (incomplete) 
page will not be output. 

9. Type ‘E’. This is like ‘X’, but it also prepares the computer to edit the 
file that TgX is currently reading, at the current position, so that you 
can conveniently make a change before trying again. 

After you type ‘H’ (or ‘h’, which also works), you’ll get a message that tries 
to explain that the control sequence just read by TgX (i.e., \vship) has never 
been assigned a meaning, and that you should either insert the correct control 
sequence or you should go on as if the offending one had not appeared. 

In this case, therefore, your best bet is to type 

I\vskip 


(and (return)), with no space after the ‘I’; this effectively replaces \vship by 
\vskip. (Do it.) 

If you had simply typed (return) instead of inserting anything, TeX 
would have gone ahead and read Tin’, which it would have regarded as part of 
a paragraph to be typeset. Alternatively, you could have typed ‘3’ ; that would 
have deleted Tin’ from TeX’s input. Or you could have typed ‘X’ or ‘E’ in order 
to correct the spelling error in your file. But it’s usually best to try to detect 
as many errors as you can, each time you run TeX, since that increases your 
productivity while decreasing your computer bills. Chapter 27 explains more 
about the art of steering TeX through troubled text. 



► EXERCISE 6.7 

Wliat would have happened if you had typed ‘5’ after the \vship error? 


f You can control the level of interaction by giving commands in your file as well 
as online: The Tj^X primitives \scrollmode, \nonstopmode, and \batchmode 
correspond respectively to typing ‘S’, ‘R’, or ‘Q’ in response to an error message, and 
\errorstopmode puts you back into the normal level of interaction. (Such changes are 
global, whether or not they appear inside a group.) Furthermore, many installations 
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have implemented a way to interrupt Tj?X while it is running; such an interruption 
causes the program to revert to \errorstopmode, after which it pauses and waits for 
further instructions. 

What happens next in Experiment 5? TpX will hiccup on the other 
bug that we planted in the file. This time, however, the error message is more 
elaborate, since the context appears on six lines instead of two: 

! Undefined control sequence. 

<argument> \bf A SHORT \ERR0R 

STORY 

\centerline #l->\line {\hss #1 

\hss } 

1.3 \centerline{\bf A SHORT \ERR0R STORY} 


interrupt 

argument 

centerline 


? 

You get multiline error messages like this when the error is detected while TpX is 
processing some higher-level commands — in this case, while it is trying to carry 
out \centerline, which is not a primitive operation (it is defined in plain TpX). 
At first, such error messages will appear to be complete nonsense to you, because 
much of what you see is low-level TpX code that you never wrote. But you can 
overcome this hangup by getting a feeling for the way TpX operates. 

First notice that the context information always appears in pairs of lines. 
As before, the top line shows what TpX has just read ( ‘\bf A SHORT \ERR0R’ ), 
then comes what it is about to read ( ‘STORY’). The next pair of lines shows the 
context of the first two; it indicates what TpX was doing just before it began to 
read the others. In this case, we see that TpX has just read ‘#1’, which is a special 
code that tells the machine to “read the first argument that is governed by the 
current control sequence” ; i.e., “now read the stuff that \centerline is supposed 
to center on a line.” The definition in Appendix B says that \centerline, when 
applied to some text, is supposed to be carried out by sticking that text in place 
of the ‘#1’ in ‘\line{\hss#l\hss}’. So TpX is in the midst of this expansion of 
\centerline, as well as being in the midst of the text that is to be centered. 

The bottom line shows how far TpX has gotten until now in the story 
file. (Actually the bottom line is blank in this example; what appears to be the 
bottom line is really the first of two lines of context, and it indicates that TpX 
has read everything including the ‘}’ in line 3 of the file.) Thus, the context in 
this error message gives us a glimpse of how TpX went about its business. First, 
it saw \centerline at the beginning of line 3. Then it looked at the definition 
of \centerline and noticed that \centerline takes an “argument,” i.e., that 
\centerline applies to the next character or control sequence or group that 
follows. So TpX read on, and filed ‘\bf A SHORT \ERR0R STORY’ away as the 
argument to \centerline. Then it began to read the expansion, as defined in 
Appendix B. When it reached the #1, it began to read the argument it had saved. 
And when it reached \ERR0R, it complained about an undefined control sequence. 
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►EXERCISE 6.8 

JL Why didn’t TpX complain about \EER0R being undefined when \ERR0R was 
first encountered, i.e., before reading ‘STORY}’ on line 3? 


When you get a multiline error message like this, the best clues about 
the source of the trouble are usually on the bottom line (since that is what 
you typed) and on the top line (since that is what triggered the error message). 
Somewhere in there you can usually spot the problem. 

Where should you go from here? If you type ‘H’ now, you’ll just get the 
same help message about undefined control sequences that you saw before. If you 
respond by typing (return), TgX will go on and finish the run, producing output 
virtually identical to that in Experiment 2. In other words, the conventional 
responses won’t teach you anything new. So type ‘E’ now; this terminates the 
run and prepares the way for you to fix the erroneous file. (On some systems, 
TgX will actually start up the standard text editor, and you’ll be positioned at 
the right place to delete ‘\ERR0R’. On other systems, T^X will simply tell you to 
edit line 3 of file story.tex.) 

When you edit story.tex again, you’ll notice that line 2 still contains 
\vship; the fact that you told TgX to insert \vskip doesn’t mean that your file 
has changed in any way. In general, you should correct all errors in the input 
file that were spotted by T^X during a run; the log file provides a handy way to 
remember what those errors were. 

Well, this has indeed been a long chapter, so let’s summarize what has 
been accomplished. By doing the five experiments you have learned at first 
hand (1) how to get a job printed via T^X; (2) how to make a file that contains 
a complete T^X manuscript; (3) how to change the plain TgX format to achieve 
columns with different widths; and (4) how to avoid panic when I);X issues 
stern warnings. 

So you could now stop reading this book and go on to print a bunch 
of documents. It is better, however, to continue bearing with the author (after 
perhaps taking another rest), since you’re just at the threshold of being able 
to do a lot more. And you ought to read Chapter 7 at least, because it warns 
you about certain symbols that you must not type unless you want to do 
something special. While reading the remaining chapters it will, of course, be 
best for you to continue making trial runs, using experiments of your own design. 

/gb/gb If you use TjjX format packages designed by others, your error messages 
JL JL may involve many inscrutable two-line levels of macro context. By setting 
\errorcontextlines=0 at the beginning of your file, you can reduce the amount of 
information that is reported; TjjX will show only the top and bottom pairs of context 
lines together with up to \errorcontextlines additional two-line items. (If anything 
has thereby been omitted, you’ll also see ‘...’.) Chances are good that you can 
spot the source of an error even when most of a large context has been suppressed; if 
not, you can say ‘I\errorcontextlines=100\oops’ and try again. (That will usually 
give you an undefined control sequence error and plenty of context.) Plain Tj?X sets 
\errorcontextlines=5. 


editing 

errorcontextlines 
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ARISTOTLE 

HABAKKUK 

COWPER 


What we have to learn to do we learn by doing. 
ARISTOTLE, Ethica Nicomachea II (c. 325 B.C.) 

He may run who reads. 
— HABAKKUK 2:2 (c. 600 B.C.) 

He that runs may read. 
— WILLIAM COWPER, Tirocinium (1785) 
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We observed in the previous chapter that an input manuscript is expressed in 
terms of “lines,” but that these lines of input are essentially independent of the 
lines of output that will appear on the finished pages. Thus you can stop typing 
a line of input at any place that’s convenient for you, as you prepare or edit a 
file. A few other related rules have also been mentioned: 


■ A (return) is like a space. 

■ Two spaces in a row count as one space. 

■ A blank line denotes the end of a paragraph. 

Strictly speaking, these rules are contradictory: A blank line is obtained by- 
typing (return) twice in a row, and this is different from typing two spaces in a 
row. Some day you might want to know the real rules. In this chapter and the 
next, we shall study the very first stage in the transition from input to output. 

In the first place, it’s wise to have a precise idea of what your keyboard 
sends to the machine. There are 256 characters that Tf.'X might encounter at 
each step, in a file or in a line of text typed directly on your terminal. These 
256 characters are classified into 16 categories numbered 0 to 15: 

Category Meaning 

0 Escape character (\ in this manual) 

1 Beginning of group ({ in this manual) 

2 End of group (} in this manual) 

3 Math shift ($ in this manual) 

4 Alignment tab (& in this manual) 

5 End of line ((return) in this manual) 

6 Parameter (# in this manual) 

7 Superscript (~ in this manual) 

8 Subscript (_ in this manual) 

9 Ignored character ((null) in this manual) 

10 Space (u in this manual) 

11 Letter (A, . . . , Z and a, . . . , z) 

12 Other character (none of the above or below) 

13 Active character (~ in this manual) 

14 Comment character (°/ 0 in this manual) 

15 Invalid character ((delete) in this manual) 

It’s not necessary for you to learn these code numbers; the point is only that 
T^X responds to 16 different types of characters. At first this manual led you to 
believe that there were just two types — the escape character and the others — 
and then you were told about two more types, the grouping symbols { and }. 
In Chapter 6 you learned two more: ~ and Now you know that there are 
really 16. This is the whole truth of the matter; no more types remain to be 
revealed. The category code for any character can be changed at any time, but 
it is usually wise to stick to a particular scheme. 


escape character 

begin-group character 

end-group character 

math mode character 
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parameter 
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subscript 

ignored character 

space 
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other character 
active character 
comment character 
invalid character 
category codes, table 
reserved character 
special character table 
null 
delete 
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The main thing to bear in mind is that each TpX format reserves certain 
characters for its own special purposes. For example, when you are using plain 
TpX format (Appendix B), you need to know that the ten characters 

\ { } $ & # ~ _ 7 ~ 


cannot be used in the ordinary way when you are typing; each of them will cause 
TpX to do something special, as explained elsewhere in this book. If you really 
need these symbols as part of your manuscript, plain TpX makes it possible for 
you to type 

\$ for $, \7 for %, \fo for &, \# for #, \_ for 


the \_ symbol is useful for compound-identifiers in computer programs. In math- 
ematics formulas you can use \{ and \} for { and }, while \backslash produces 
a reverse slash; for example, 

‘$\{a \backslash b\}$’ yields ‘{a\b}\ 

Furthermore \~ produces a circumflex accent (e.g., ‘\~e’ yields ‘e’ ); and \~ yields 
a tilde accent (e.g., ‘\~n’ yields ‘n’). 


► EXERCISE 7.1 

What horrible errors appear in the following sentence? 

Procter & Gamble’s stock climbed to $2, a 107 gain. 


► EXERCISE 7.2 

Can you imagine why the designer of plain TpX decided not to make ‘\V the 
control sequence for reverse slashes? 

f Wlien T^X reads a line of text from a file, or a line of text that you entered 
directly on your terminal, it converts that text into a list of “tokens.” A 
token is either (a) a single character with an attached category code, or (b) a control 
sequence. For example, if the normal conventions of plain are in force, the text 
‘{\hskip 36 pt}’ is converted into a list of eight tokens: 

Ii |hskip| 3i2 6 12 U10 Pll til 1"2 


The subscripts here are the category codes, as listed earlier: 1 for “beginning of group,” 
12 for “other character,” and so on. The |hskip| doesn’t get a subscript, because it 
represents a control sequence token instead of a character token. Notice that the space 
after \hskip does not get into the token list, because it follows a control word. 

f it is important to understand the idea of token lists, if you want to gain a 
thorough understanding of T^X, and it is convenient to learn the concept by 
thinking of TjgX as if it were a living organism. The individual lines of input in your 
files are seen only by T^gX’s “eyes” and “mouth” ; but after that text has been gobbled 
up, it is sent to TpX’s “stomach” in the form of a token list, and the digestive processes 
that do the actual typesetting are based entirely on tokens. As far as the stomach is 
concerned, the input flows in as a stream of tokens, somewhat as if your TpX manuscript 
had been typed all on one extremely long line. 
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backslash 
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f You should remember two chief things about TjjX’s tokens: (1) A control 
sequence is considered to be a single object that is no longer composed of a 
sequence of symbols. Therefore long control sequence names are no harder for Tj^X to 
deal with than short ones, after they have been replaced by tokens. Furthermore, spaces 
are not ignored after control sequences inside a token list; the ignore-space rule applies 
only in an input file, during the time that strings of characters are being tokenized. 
(2) Once a category code has been attached to a character token, the attachment is 
permanent. For example, if character were suddenly declared to be of category 12 
instead of category 1, the characters ‘{i’ already inside token lists of TjjX would still 
remain of category 1; only newly made lists would contain ‘{ 12 ’ tokens. In other words, 
individual characters receive a fixed interpretation as soon as they have been read from 
a file, based on the category they have at the time of reading. Control sequences 
are different, since they can change their interpretation at any time. TjrX’s digestive 
processes always know exactly what a character token signifies, because the category 
code appears in the token itself; but when the digestive processes encounter a control 
sequence token, they must look up the current definition of that control sequence in 
order to figure out what it means. 

(£)(£)*■ EXERCISE 7.3 

JL JL Some of the category codes 0 to 15 will never appear as subscripts in character 
tokens, because they disappear in TgX’s mouth. For example, characters of category 0 
(escapes) never get to be tokens. Which categories can actually reach TjrX’s stomach? 

&)(&) There’s a program called INITEX that is used to install TgX, starting from 
JL JL scratch; INITEX is like TjgX except that it can do even more things. It can 
compress hyphenation patterns into special tables that facilitate rapid hyphenation, and 
it can produce format files like ‘plain. fmt’ from ‘plain.tex’. But INITEX needs extra 
space to carry out such tasks, so it generally has less memory available for typesetting 
than you would expect to find in a production version of TjrX. 

When INITEX begins, it knows nothing but IjjX’s primitives. All 256 charac- 
JL JL ters are initially of category 12, except that (return) has category 5, (space) 
has category 10, (null) has category 9, (delete) has category 15, the 52 letters A . . . Z and 
a . . . z have category 11, / and \ have the respective categories 14 and 0. It follows that 
INITEX is initially incapable of carrying out some of TjjX’s primitives that depend on 
grouping; you can’t use \def or \hbox until there are characters of categories 1 and 2. 
The format in Appendix B begins with \catcode commands to provide characters of 
the necessary categories; e.g., 

\catcode ‘ \{=1 


assigns category 1 to the { symbol. The \catcode operation is like many other primi- 
tives of TjrX that we shall study later; by modifying internal quantities like the category 
codes, you can adapt TjjX to a wide variety of applications. 



► EXERCISE 7.4 

Suppose that the commands 


\catcode ‘ \<=1 \catcode ‘\>=2 
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INITEX 

hyphenation 

format 

return 

space 

null 

delete 

backslash 

percent 

catcode 


appear near the beginning of a group that begins with these specifications instruct 
TjrX to treat < and > as group delimiters. According to TjjX’s rules of locality, the 
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characters < and > will revert to tlieir previous categories when the group ends. But 
should the group end with } or with > ? 

/gL/gL Although control sequences are treated as single objects, does provide 
JL JL a way to break them into lists of character tokens: If you write \string\cs, 
where \cs is any control sequence, you get the list of characters for that control se- 
quence’s name. For example, \string\TeX produces four tokens: \i 2 , T 12 , ei 2 , X 12 . 
Each character in this token list automatically gets category code 12 (“other”), in- 
cluding the backslash that \string inserts to represent an escape character. However, 
category 10 will be assigned to the character ‘ u ’ (blank space) if a space character 
somehow sneaks into the name of a control sequence. 

/gL/gL Conversely, you can go from a list of character tokens to a control sequence by 
JL JL saying ‘\csname (tokens) \endcsname’. The tokens that appear in this construc- 
tion between \csname and \endcsname may include other control sequences, as long as 
those control sequences ultimately expand into characters instead of TpX primitives; the 
final characters can be of any category, not necessarily letters. For example, ‘\csname 
TeX\endcsname’ is essentially the same as ‘\TeX’; but ‘\csname\TeX\endcsname’ is il- 
legal, because \TeX expands into tokens containing the \kern primitive. Furthermore, 
‘\csname\string\TeX\endcsname’ will produce the unusual control sequence ‘\\TeX’, 
i.e., the token |\TeX| , which you can’t ordinarily write. 

(£)(£)*■ EXERCISE 7.5 

JL JL Experiment with Ij^X to see what \ string does when it is followed by an 
active character like (Active characters behave like control sequences, but they are 
not prefixed by an escape.) What is an easy way to conduct such experiments online? 
What control sequence could you put after \string to obtain the single character 
token \i 2 ? 

<§)(£)*■ EXERCISE 7.6 

JL JL What tokens does ‘\expandafter\string\csname a\string\ b\endcsname’ 
produce? (There are three spaces before the b. Chapter 20 explains \expandafter.) 

EXERCISE 7.7 

JL JL When \csname is used to define a control sequence for the first time, that 
control sequence is made equivalent to \relax until it is redefined. Use this fact to 
design a macro \ifundef ined#l such that, for example, 

\ifundef ined{TeX}(true text)\else(false text)\fi 

expands to the (true text) if \TeX hasn’t previously been defined, or if \TeX has been 
\let equal to \relax; it should expand to the (false text) otherwise. 

/gK/gK In the examples so far, \string has converted control sequences into lists of 
JL JL tokens that begin with \i 2 . But this backslash token isn’t really hardwired into 
TEX; there’s a parameter called \escapechar that specifies what character should be 
used when control sequences are output as text. The value of \escapechar is normally 
TeX’s internal code for backslash, but it can be changed if another convention is desired. 

/gL/gL TgX has two other token-producing operations similar to the \ string com- 
JL JL mand. If you write \number (number), you get the decimal equivalent of the 
(number); and if you write \romannumeral (number), you get the number expressed in 
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lowercase roman numerals. For example, ‘\romannumeral24’ produces ‘xxiv’, a list of 
four tokens each having category 12. The \number operation is redundant when it is 
applied to an explicit constant (e.g. , ‘\number24 1 produces ‘24’); but it does suppress 
leading zeros, and it can also be used with numbers that are in TjrX’s internal registers 
or parameters. For example, ‘\number-0015’ produces ‘ — 15’; and if register \count5 
holds the value 316, then ‘\number\count5’ produces ‘316’. 

/>)/>) The twin operations \uppercase{ (token list)} and \lowercase{ (token list)} 
JL JL go through a given token list and convert all of the character tokens to their 
“uppercase” or “lowercase” equivalents. Here’s how: Each of the 256 possible charac- 
ters has two associated values called the \uccode and the \lccode; these values are 
changeable just as a \catcode is. Conversion to uppercase means that a character 
is replaced by its \uccode value, unless the \uccode value is zero (when no change 
is made). Conversion to lowercase is similar, using the \lccode. The category codes 
aren’t changed. When INITEX begins, all \uccode and \lccode values are zero except 
that the letters a to z and A to Z have \uccode values A to Z and \lccode values a to z. 

/>)/>) TgX performs the \uppercase and \lowercase transformations in its stomach, 
JL JL but the \string and \number and \romannumeral and \csname operations are 
carried out en route to the stomach (like macro expansion), as explained in Chapter 20. 

(£)(£)*■ EXERCISE 7.8 

JL JL What token list results from ‘\uppercase{a\lowercase{bC}}’ ? 

(£)(£)*■ EXERCISE 7.9 

JL JL TpX has an internal integer parameter called \year that is set equal to the cur- 
rent year number at the beginning of every job. Explain how to use \year, together with 
\romannumeral and \uppercase, to print a copyright notice like ‘© MCMLXXXVF 
for all jobs run in 1986. 

EXERCISE 7.10 

JL JL Define a control sequence \appendroman with three parameters such that 
\appendroman#l#2#3 defines control sequence #1 to expand to a control sequence whose 
name is the name of control sequence #2 followed by the value of the positive integer 
#3 expressed in roman numerals. For example, suppose \count20 equals 30; then 
l \appendroman\a\TeX{\count20}’ should have the same effect as ‘\def \a{\TeXxxx}’. 
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tricky macros 
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Some bookes are to bee tasted, 
others to bee swallowed, 
and some few to bee chewed and disgested. 
— FRANCIS BACON, Essayes (1597) 

'T/s the good reader that makes the good book. 
— RALPH WALDO EMERSON, Society Solitude (1870) 
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A lot of different keyboards are used with TgX, but few keyboards can produce 
256 different symbols. Furthermore, as we have seen, some of the characters that 
you can type on your keyboard are reserved for special purposes like escaping 
and grouping. Yet when we studied fonts it was pointed out that there are 256 
characters per font. So how can you refer to the characters that aren’t on your 
keyboard, or that have been pre-empted for formatting? 

One answer is to use control sequences. For example, the plain format 
of Appendix B, which defines “/, to be a special kind of symbol so that you can 
use it for comments, defines the control sequence \7, to mean a percent sign. 

To get access to any character whatsoever, you can type 

\ char (number) 

where (number) is any number from 0 to 255 (optionally followed by a space); 
you will get the corresponding character from the current font. That’s how 
Appendix B handles Y/,; it defines ‘Y/.’ to be an abbreviation for ‘\char37’, since 
37 is the character code for a percent sign. 

The codes that Tj^X uses internally to represent characters are based on 
“ASCII,” the American Standard Code for Information Interchange. Appendix C 
gives full details of this code, which assigns numbers to certain control functions 
as well as to ordinary letters and punctuation marks. For example, (space) = 32 
and (return) = 13. There are 94 standard visible symbols, and they have been 
assigned code numbers from 33 to 126, inclusive. 

It turns out that ‘b’ is character number 98 in ASCII. So you can typeset 
the word bubble in a strange way by putting 

\char98 u\char98\char98 le 


into your manuscript, if the b-key on your typewriter is broken. (An optional 
space is ignored after constants like ‘98’. Of course you need the \, c, h, a, and r 
keys to type ‘\char’, so let’s hope that they are always working.) 

f TpX always uses the internal character code of Appendix C for the standard 
ASCII characters, regardless of what external coding scheme actually appears 
in the files being read. Thus, b is 98 inside of T^jX even when your computer normally 
deals with EBCDIC or some other non-ASCII scheme; the l^rX software has been set 
up to convert text files to internal code, and to convert back to the external code when 
writing text files. Device-independent (dvi) output files use TjXX’s internal code. In 
this way, TgX is able to give identical results on all computers. 


f Character code tables like those in Appendix C often give the code numbers in 
octal notation, i.e., the radix-8 number system, in which the digits are 0, 1 , 2, 
3, 4, 5, 6, and 7.* Sometimes hexadecimal notation is also used, in which case the digits 
are 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, A, B, C, D, E, and F. For example, the octal code for ‘b’ is 
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octal notation 
hexadecimal notation 


* The author of this manual likes to use italic digits for octal numbers, and type- 
writer type for hexadecimal numbers, in order to provide a typographic clue to the 
underlying radix whenever possible. 
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142, and its hexadecimal code is 62. A (number) in TgX’s language can begin with a ’ , 
in which case it is regarded as octal, or with a ", when it is regarded as hexadecimal. 
Thus, \char’142 and \char"62 are equivalent to \char98. The legitimate character 
codes in octal notation run from '0 to '577 ; in hexadecimal, they run from "0 to "FF. 

f But T)gX actually provides another kind of (number) that makes it unnecessary 
for you to know ASCII at all! The token ‘12 (left quote), when followed by 
any character token or by any control sequence token whose name is a single character, 
stands for TJrX’s internal code for the character in question. For example, \char ‘b and 
\char‘\b are also equivalent to \char98. If you look in Appendix B to see how \Z, is 
defined, you’ll notice that the definition is 

\def \’/,{\char ‘ Y/.} 


instead of \char37 as claimed above. 

/ ►EXERCISE 8.1 

JL What would be wrong with \def \'/,{\char “/,}? 

The preface to this manual points out that the author tells little white lies 
from time to time. Well, if you actually check Appendix B you’ll find that 

\chardef Y/,= ‘ Y/. 


is the true definition of Y/,. Since format designers often want to associate a spe- 
cial character with a special control sequence name, TfrX provides the construction 
‘\chardef (control sequence) = (number)’ for numbers between 0 and 255, as an efficient 
alternative to ‘\def (control sequence){\char(number)}’. 

Although you can use \char to access any character in the current font, 
you can’t use it in the middle of a control sequence. For example, if you type 

\\ char 98 


TgX reads this as the control sequence \\ followed by c, h, a, etc., not as the 
control sequence \b. 

You will hardly ever need to use \char when typing a manuscript, since 
the characters you want will probably be available as predefined control se- 
quences; \char is primarily intended for the designers of book formats like those 
in the appendices. But some day you may require a special symbol, and you 
may have to hunt through a font catalog until you find it. Once you find it, 
you can use it by simply selecting the appropriate font and then specifying the 
character number with \char. For example, the “dangerous bend” sign used in 
this manual appears as character number 127 of font manfnt, and that font is 
selected by the control sequence \manual. The macros in Appendix E therefore 
display dangerous bends by saying ‘{\manual\charl27}’. 

We have observed that the ASCII character set includes only 94 printable 
symbols; but T^X works internally with 256 different character codes, from 0 
to 255, each of which is assigned to one of the sixteen categories described in 
Chapter 7. If your keyboard has additional symbols, or if it doesn’t have the 
standard 94, the people who installed your local T 13 X system can tell you the 
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correspondence between what you type and the character number that TfrjX 
receives. Some people are fortunate enough to have keys marked and ‘<’ and 
it is possible to install TpX so that it will recognize these handy symbols 
and make the typing of mathematics more pleasant. But if you do not have such 
keys, you can get by with the control sequences \ne, \le, and \ge. 

f TpX has a standard way to refer to the invisible characters of ASCII: Code 0 
can be typed as the sequence of three characters ““<3, code 1 can be typed 
“A, and so on up to code 31, which is ““_ (see Appendix C). If the character following 
““ has an internal code between 64 and 127, TpX subtracts 64 from the code; if the 
code is between 0 and 63, TgX adds 64. Hence code 127 can be typed **?, and 
the dangerous bend sign can be obtained by saying {\manual““?}. However, you must 
change the category code of character 127 before using it, since this character ordinarily 
has category 15 (invalid); say, e.g., \catcode ‘\**?=12. The ““ notation is different from 
\char, because ““ combinations are like single characters; for example, it would not 
be permissible to say \catcode ‘ \charl27, but ““ symbols can even be used as letters 
within control words. 


One of the overfull box messages in Chapter 6 illustrates the fact that TjrX 
sometimes uses the funny ““ convention in its output: The umlaut character 
in that example appears as ““?, and the cedilla appears as “X, because 1 " ’ and 1 J ’ 
occur in positions 'll 7 and 'SO of the \tenrm font. 

There’s also a special convention in which “* is followed by two “lowercase 
hexadecimal digits,” 0-9 or a-f. With this convention, all 256 characters are 
obtainable in a uniform way, from ““00 to “*ff. Character 127 is **7f. 





Most of the ““ codes are unimportant except in unusual applications. But 
**M is particularly noteworthy because it is code 13, the ASCII (return) that 


T)gX normally places at the right end of every line of your input file. By changing the 


category of “ “M you can obtain useful special effects, as we shall see later. 



The control code ““I is also of potential interest, since it’s the ASCII (tab). 
Plain TgX makes (tab) act like a blank space. 


/>>/>) People who install T)gX systems for use with non-American alphabets can 
JL JL make TJ^X conform to any desired standard. For example, suppose you have a 
Norwegian keyboard containing the letter as, which conies in as code 241 (say). Your lo- 
cal format package should define \catcode‘as=ll; then you could have control sequences 
like \sasrtrykk. Your T)gX input files could be made readable by American installa- 
tions of T^rX that don’t have your keyboard, by substituting ~~fl for character 241. 
(For example, the stated control sequence would appear as \s““flrtrykk in the file; 
your American friends should also be provided with the format that you used, with 
its \catcode‘ f 1=11. ) Of course you should also arrange your fonts so that TgK’s 
character 241 will print as as; and you should change TgX’s hyphenation algorithm so 
that it will do correct Norwegian hyphenation. The main point is that such changes are 
not extremely difficult; nothing in the design of TJgX limits it to the American alphabet. 
Fine printing is obtained by fine tuning to the language or languages being used. 



European languages can also be accommodated effectively with only a limited 
character set. For example, let’s consider Norwegian again, but suppose that 


ne 

le 

ge 

not-equal 

less-or-equal 

greater-or-equal 

invalid 

double hat 

hat hat 

tenrm 

return 

hat hat M 

tab 

Norwegian keyboard 
Scandinavian letters 
foreign languages 
keyboards, non- ASCII 
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you want to use a keyboard without an ae character. You can arrange the font metric file 
so that TgX will interpret ae, o/, aa, AE, 0/, and AA as ligatures that produce ae, 0 , a, 
iE, 0, and A, respectively; and you could put the characters a and A into positions 128 
and 129 of the font. By setting \catcode ‘/=11 you would be able to use the ligature 
o/ in control sequences like ‘\ho/yre’. TpjX’s hyphenation method is not confused by 
ligatures; so you could use this scheme to operate essentially as suggested before, but 
with two keystrokes occasionally replacing one. (Your typists would have to watch 
out for the occasional times when the adjacent characters aa, oe, and o/ should not be 
treated as ligatures; also, ‘\/’ would be a control word, not a control symbol.) 

The rest of this chapter is devoted to IjgX’s reading rules, which define the 
J_ J_ conversion from text to tokens. For example, the fact that TgX ignores spaces 
after control words is a consequence of the rules below, which imply among other things 
that spaces after control words never become space tokens. The rules are intended to 
work the way you would expect them to, so you may not wish to bother reading them; 
but when you are communicating with a computer, it is nice to understand what the 
machine thinks it is doing, and here’s your chance. 


/>>/>) The input to TjgX is a sequence of “lines.” Whenever TjgX is reading a line of 
JL JL text from a file, or a line of text that you entered directly on your terminal, 
the computer’s reading apparatus is in one of three so-called states: 


State N Beginning a new line; 

State M Middle of a line; 

State S Skipping blanks. 

At the beginning of every line it’s in state N ; but most of the time it’s in state M, 
and after a control word or a space it’s in state S. Incidentally, “states” are different 
from the “modes” that we will be studying later; the current state refers to l^X’s 
eyes and mouth as they take in characters of new text, but the current mode refers 
to the condition of T^X’s gastro-intestinal tract. Most of the things that T^X does 
when it converts characters to tokens are independent of the current state, but there 
are differences when spaces or end-of-line characters are detected (categories 10 and 5). 

/gb/gb TjrjX deletes any (space) characters (number 32) that occur at the right end 
JL J_ of an input line. Then it inserts a (return) character (number 13) at the right 
end of the line, except that it places nothing additional at the end of a line that you 
inserted with ‘I’ during error recovery. Note that (return) is considered to be an actual 
character that is part of the line; you can obtain special effects by changing its catcode. 


&)(&) If Ij’-X sees an escape character (category 0) in any state, it scans the entire 
JL JL control sequence name as follows, (a) If there are no more characters in the 
line, the name is empty (like \csname\endcsname). Otherwise (b) if the next character 
is not of category 11 (letter), the name consists of that single symbol. Otherwise (c) the 
name consists of all letters beginning with the current one and ending just before the 
first nonletter, or at the end of the line. This name becomes a control sequence token. 
Tj?X goes into state S in case (c), or in case (b) with respect to a character of category 10 
(space); otherwise IjrX goes into state M. 



If Tj^X sees a superscript character (category 7) in any state, and if that charac- 
ter is followed by another identical character, and if those two equal characters 


font metric file 

control word 

control symbol 

lines 

states 

modes 

tokens 

space 

return 

error recovery 
control sequence 
null control sequence 
csname endcsname 
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are followed by a character of code c < 128, then they are deleted and 64 is added to 
or subtracted from the code c. (Thus, “A is replaced by a single character whose 
code is 1, etc., as explained earlier.) However, if the two superscript characters are im- 
mediately followed by two of the lowercase hexadecimal digits 0123456789abcdef , the 
four-character sequence is replaced by a single character having the specified hexadec- 
imal code. The replacement is carried out also if such a trio or quartet of characters is 
encountered during steps (b) or (c) of the control-sequence-name scanning procedure 
described above. After the replacement is made, Tf'X begins again as if the new char- 
acter had been present all the time. If a superscript character is not the first of such a 
trio or quartet, it is handled by the following rule. 

/gb/gb If TjgX gees a character of categories 1, 2, 3, 4, 6, 8, 11, 12, or 13, or a character 
JL JL of category 7 that is not the first of a special sequence as just described, it 
converts the character to a token by attaching the category code, and goes into state M. 
This is the normal case; almost every nonblank character is handled by this rule. 

/gb/gb If T^X sees an end-of-line character (category 5), it throws away any other 
JL JL information that might remain on the current line. Then if is in state N 
(new line), the end-of-line character is converted to the control sequence token ‘ |par| ’ 
(end of paragraph); if TpX is in state M (mid-line), the end-of-line character is con- 
verted to a token for character 32 (‘ u ’) of category 10 (space); and if Tf-'X is in state S 
(skipping blanks), the end-of-line character is simply dropped. 

/gb/gb If TjrX sees a character to be ignored (category 9), it simply bypasses that 
JL JL character as if it weren’t there, and remains in the same state. 

/gb/gb If TgX sees a character of category 10 (space), the action depends on the 
JL JL current state. If TjgX is in state N or S, the character is simply passed by, and 
IjjX remains in the same state. Otherwise TjjX is in state M ; the character is converted 
to a token of category 10 whose character code is 32, and TjjX enters state S. The 
character code in a space token is always 32. 

/gb/gb If Tj^X sees a comment character (category 14), it throws away that character 
JL JL and any other information that might remain on the current line. 

/gb/gb Finally, if TjjX sees an invalid character (category 15), it bypasses that char- 
JL JL acter, prints an error message, and remains in the same state. 

/gb/gb If TfeX has nothing more to read on the current line, it goes to the next line 
JL JL and enters state N. However, if \endinput has been specified for a file being 
\input, or if an \input file lias ended, TjjX returns to whatever it was reading when 
the \input command was originally given. (Further details of \input and \endinput 
are discussed in Chapter 20.) 

<§)(£)*■ EXERCISE 8.2 

JL JL Test your understanding of TjgX’s reading rules by answering the following 
quickie questions: (a) What is the difference between categories 5 and 14? (b) What is 
the difference between categories 3 and 4? (c.) What is the difference between categories 
11 and 12? (d) Are spaces ignored after active characters? (e) When a line ends with a 
comment character like '/„ are spaces ignored at the beginning of the next line? (f) Can 
an ignored character appear in the midst of a control sequence name? 


par 

space 

endinput 

input 



48 Chapter 8: The Characters You Type 


<&><&>*■ EXERCISE 8.3 

JL JL Look again at the error message that appears on page 31. When TjgX reported 
that \vship was an undefined control sequence, it printed two lines of context, showing 
that it was in the midst of reading line 2 of the story file. At the time of that error 
message, what state was TjrX in? What character was it about to read next? 

<&>&>*' EXERCISE 8.4 

JL JL Given the category codes of plain l^X format, what tokens are produced from 
the input line 1 $x“2$~ \TeX “~62"~6’? 

&)<£)*■ EXERCISE 8.5 

JL JL Consider an input file that contains exactly three lines; the first line says ‘Hi ! ’, 
while the other two lines are completely blank. What tokens are produced when TjtX 
reads this file, using the category codes of plain TjfX format? 

&)<£)*■ EXERCISE 8.6 

JL JL Assume that the category codes of plain TjrX are in force, except that the char- 
acters “"A, **B, **C, ~"M belong respectively to categories 0, 7, 10, and 11. What tokens 
are produced from the (rather ridiculous) input line 

(Remember that this line is followed by (return), which is “M; and recall that “0 
denotes the (null) character, which has category 9 when INITEX begins.) 

The special character inserted at the end of each line needn’t be (return); TjgX 
JL JL actually inserts the current value of an integer parameter called \endlinechar, 
which normally equals 13 but it can be changed like any other parameter. If the value 
of \endlinechar is negative or greater than 255, no character is appended, and the 
effect is as if every line ends with '/, (i.e., with a comment character). 

Since it is possible to change the category codes, TjrX might actually use 
JL JL several different categories for the same character on a single line. For example, 
Appendices D and E contain several ways to coerce TjgX to process text “verbatim,” 
so that the author could prepare this manual without great difficulty. (Try to imagine 
typesetting a T|-X manual; backslashes and other special characters need to switch back 
and forth between their normal categories and category 12!) Some care is needed to 
get the timing right, but you can make Tj?X behave in a variety of different ways by 
judiciously changing the categories. On the other hand, it is best not to play with the 
category codes very often, because you must remember that characters never change 
their categories once they have become tokens. For example, when the arguments to a 
macro are first scanned, they are placed into a token list, so their categories are fixed 
once and for all at that time. The author has intentionally kept the category codes 
numeric instead of mnemonic, in order to discourage people from making extensive use 
of \catcode changes except in unusual circumstances. 

&)<£)*■ EXERCISE 8.7 

JL JL Appendix B defines \lq and \rq to be abbreviations for ‘ and ’ (single left 
and right quotes, respectively). Explain why the definitions 

\chardef\lq=96 \chardef\rq=39 


null 

return 

endlinechar 

verbatim 

lq 

rq 


would not be as good. 
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Cummings 

ASCII 


for life's not a paragraph 

And death i think is no parenthesis. 
— e. e. cummings, since feeling is first (1926) 

This coded character set is to facilitate 
the general interchange of information 
among information processing systems, 
communication systems, and 
associated equipment. 
...An 8-bit set was considered 
but the need for more than 128 codes 
in general applications was not yet evident. 
ASA SUBCOMMITTEE X3.2, American Standard 
Code for Information Interchange (1963) 
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When you’re typing a manuscript for TgX, you need to know what symbols 
are available. The plain T^X format of Appendix B is based on the Computer 
Modern fonts, which provide the characters needed to typeset a wide variety 
of documents. It’s time now to discuss what a person can do with plain Tj^X 
when typing straight text. We’ve already touched on some of the slightly subtle 
things — for example, dashes and quotation marks were considered in Chapter 2, 
and certain kinds of accents appeared in the examples of Chapters 3 and 6. The 
purpose of this chapter is to give a more systematic summary of the possibilities, 
by putting all the facts together. 

Let’s begin with the rules for the normal roman font (\rm or \tenrm); 
plain Tf.;X will use this font for everything unless you specify otherwise. Most of 
the ordinary symbols that you need are readily available and you can type them 
in the ordinary way: There’s nothing special about 

the letters A to Z and a to z 

the digits 0 to 9 

common punctuation marks : ; !?() [] 1 @ 

except that TgX recognizes certain combinations as ligatures: 

ff yields ff; ff i yields ffi; ‘ ‘ yields “ ; ! ‘ yields j ; 

fi yields fi; ffl yields ffl; ’ ’ yields ” ; ?‘ yields l, 

fl yields fl; — yields-; yields — ; 

You can also type + and =, to get the corresponding symbols + and =; but it’s 
much better to use such characters only in math mode, i.e. , enclosed between 
two $ signs, since that tells TgX to insert the proper spacing for mathemat- 
ics. Math mode is explained later; for now, it’s just a good idea to remember 
that formulas and text should be segregated. A non-mathematical hyphen and 
a non-mathematical slash should be specified by typing and 7’ outside of 
mathematics mode, but subtraction and division should be specified by typing 
and 7’ between $ signs. 

The previous paragraph covers 80 of the 94 visible characters of standard 
ASCII; so your keyboard probably contains at least 14 more symbols, and you 
should learn to watch out for the remaining ones, since they are special. Four of 
these are preempted by plain TgX; if your manuscript requires the symbols 

$ # 7 . & 

you should remember to type them as 

\$ \# \V. \& 

respectively. Plain TgX also reserves the six symbols 

\ { } 

but you probably don’t mind losing these, since they don’t appear in normal 
copy. Braces and backslashes are available via control sequences in math mode. 


letters 

digits 

punctuation 

ligatures 

Spanish ligatures 

Colon 

Semicolon 

Exclamation point 

Shriek, see exclamation point 

Question mark 

Parentheses 

Brackets 

Apostrophe 

Reverse apostrophe 

Hamza, see apostrophe 

Ain, see reverse apostrophe 

Hyphen 

Dash 

Asterisk 

At sign 

Virgule, see slash 
Solidus, see slash 
Shilling sign, see slash 
Slash 
Period 

Full stop, see period 
Comma 
Plus sign 
Equals sign 
dollar sign 

sharp sign, see hash mark 

number sign, see hash mark 

hash mark 

percent sign 

ampersand 

backslash 

braces 

curly braces, see braces 

hat, see circumflex 

circumflex 

underline 

tilde 
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There are four remaining special characters in the standard ASCII set: 
" | < > 

Again, you don’t really want them when you’re typesetting text. (Double-quote 
marks should be replaced either by ‘ ‘ or by ’ ’ ; vertical lines and relation signs 
are needed only in math mode.) 

Scholarly publications in English often refer to other languages, so plain 
T^X makes it possible to typeset the most commonly used accents: 


Type 

to get 


Vo 

6 

(grave accent) 

Vo 

6 

(acute accent) 

\~o 

6 

(circumflex or “hat”) 

\"o 

6 

(umlaut or dieresis) 

\~o 

6 

(tilde or “squiggle”) 

\=o 

o 

(macron or “bar”) 

\.o 

6 

(dot accent) 

\u o 

6 

(breve accent) 

\v o 

6 

(hacek or “check”) 

\H o 

6 

(long Hungarian umlaut) 

\t oo 

oo 

(tie-after accent) 

Within the font, such accents are designed to appear at the right height for the 
letter ‘o’; but you can use them over any letter, and TgX will raise an accent that 
is supposed to be taller. Notice that spaces are needed in the last four cases, to 
separate the control sequences from the letters that follow. You could, however, 
type ‘\H{o}’ in order to avoid putting a space in the midst of a word. 

Plain TgX also provides three accents that go underneath: 

Type 

to get 


\c o 

9 

(cedilla accent) 

\d o 

o 

(dot-under accent) 

\b o 

o 

(bar-under accent) 

And there are a few special letters: 

Type 

to get 


\oe , \0E 

oe, 03 

(French ligature OE) 

\ae , \AE 

se, 3E 

(Latin and Scandinavian ligature AE) 

\aa, \AA 

a, A 

(Scandinavian A-with-circle) 

\o , \0 

0,0 

(Scandinavian O-with-slash) 

\1,\L 

1,L 

(Polish suppressed-L) 

\ss 

8 

(German “es-zet” or sharp S) 


The \rm font contains also the dotless letters ‘ 1 ’ and ‘j\ which you can obtain by 
typing ‘\i’ and ! \j’- These are needed because ‘i’ and ‘j’ should lose their dots 


double-quote mark 
vertical line, see norm 
norm symbol 
less than sign 
greater than sign 
accents 

grave accent 

acute accent 
esc hat 

circumflex accent 
hat accent 

umlaut accent 
dieresis 
esc tilde 
tilde accent 
squiggle accent 

macron accent 
bar accent 

dot accent 
v 

hacek accent 
check accent 
u 

breve accent 

H 

Hungarian umlaut 
t 

tie-after accent 

embellished letters, see accents 
c 

cedilla accent 
d 

dot-under accent 
emphatics, see dot-under 
b 

bar-under accent 

Scandinavian letters 

sharp S 

es-zet 

German 

Polish 

Norwegian 

Danish 

Swedish 

Icelandic 

suppressed-L 

diphthongs, see ae, oe 

dotless letters 

i 

j 
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when they gain an accent. For example, the right way to obtain ‘minus’ is to 
type ‘m\=\i n\u us’ or i m\={\i}n\u{u}s’. 

This completes our summary of the \rm font. Exactly the same conven- 
tions apply to \bf , \sl, and \it, so you don’t have to do things differently when 
you’re using a different typeface. For example, \bf \"o yields 6 and \it\& yields 
&. Isn’t that nice? 

f However, \tt is slightly different. You will be glad to know that ff, f i, and so 
on are not treated as ligatures when you’re using typewriter type; nor do you 
get ligatures from dashes and quote marks. That’s fine, because ordinary dashes and 
ordinary double-quotes are appropriate when you’re trying to imitate a typewriter. 
Most of the accents are available too. But \H, \., \1, and \L cannot be used — the 
typewriter font contains other symbols in their place. Indeed, you are suddenly allowed 
to type ", I , <, and >; see Appendix F. All of the letters, spaces, and other symbols in 
\tt have the same width. 

► EXERCISE 9.1 

What’s the non-naive way to type ‘naive’? 

► EXERCISE 9.2 

List some English words that contain accented letters. 

► EXERCISE 9.3 

How would you type ‘fEsop’s CEuvres en frangais’? 

► EXERCISE 9.4 

Explain what to type in order to get this sentence: Commentarii Academies 
scientiarum imperialis petropolitanee is now Akademiia Nauk SSSR, Doklady. 


► EXERCISE 9.5 

And how would you specify the names Ernesto Cesaro, Pal Erdos, 0ystein Ore, 
Stanislaw Swierczkowski, Sergei Iur’ev, Muhammad ibn Musa al-Khwarizmi? 



► EXERCISE 9.6 

Devise a way to typeset Pal Erdos in typewriter type. 


The following symbols come out looking exactly the same whether you 
are using \rm, \sl, \bf, \it, or \tt: 


Type to get 

\dag f (dagger or obelisk) 

\ddag i (double dagger or diesis) 

\S § (section number sign) 

\P (paragraph sign or pilcrow) 

(They appear in just one style because plain TgX gets them from the math 
symbols font. Lots of other symbols are needed for mathematics; we shall study 
them later. See Appendix B for a few more non-math symbols.) 


typewriter type 

doublequote 

vertical line 

less than sign 

greater than sign 

Cesaro 

Erdos 

Ore 

Swiercz... 

Iur’ev 

al-Khwarizmi 

dagger 

double dagger 
obelisk 

obelus, see obelisk 
diesis 

section number sign 

paragraph sign 

pilcrow, see paragraph sign 
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► EXERCISE 9.7 

In plain Tj^X’s italic font, the “$’ sign comes out as L £ This gives you a way 
to refer to pounds sterling, but you might want an italic dollar sign. Can you 
think of a way to typeset a reference to the book Europe on $15.00 a day ? 

Appendix B shows that plain TgX handles most, of the accents by using TJrX’s 
JL JL \accent primitive. For example, \ ’ #1 is equivalent to {\accentl9 #1}, where 
#1 is the argument being accented. The general rule is that \accent{number) puts an 
accent over the next character; the (number) tells where that accent appears in the 
current font. The accent is assumed to be properly positioned for a character whose 
height equals the x-heiglit of the current font; taller or shorter characters cause the 
accent to be raised or lowered, taking due account of the slantedness of the fonts of 
accenter and accentee. The width of the final construction is the width of the character 
being accented, regardless of the width of the accent. Mode-independent commands like 
font changes may appear between the accent number and the character to be accented, 
but grouping operations must not intervene. If it turns out that no suitable character 
is present, the accent will appear by itself as if you had said \char(number) instead of 
\accent(number). For example, \’{} produces '. 

(£)(£)*■ EXERCISE 9.8 

JL JL Why do you think plain TfrX defines \’#1 to be ‘{\accentl9 #1}’ instead of 
simply letting V be an abbreviation for ‘\accentl9 ’? (Why the extra braces, and 
why the argument #1?) 

It’s important to remember that these conventions we have discussed for ac- 
JL JL cents and special letters are not built into TgX itself; they belong only to the 
plain TgX format, which uses the Computer Modern fonts. Quite different conventions 
will be appropriate when other fonts are involved; format designers should provide rules 
for how to obtain accents and special characters in their particular systems. Plain TfrX 
works well enough when accents are infrequent, but the conventions of this chapter 
are by no means recommended for large-scale applications of TgX to other languages. 
For example, a well-designed TgX font for French might well treat accents as liga- 
tures, so that one could e’er ire de cette manie're nai"ve en franc/ais without 
backslashes. (See the remarks about Norwegian in Chapter 8.) 


dollar sign 

British pound sign 

pound sterling 

sterling 

accent 

x-height 

French 

foreign languages 
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SHAKESPEARE 

Munster 

ROOT 


Let's doo't after the high Roman fashion. 
— WILLIAM SHAKESPEARE, The Tragedie of Anthony and Cleopatra (1606) 

English is a straightforward, frank, honest, open-hearted, no-nonsense language, 
which has little truck with such devilish devious devices as accents; 
indeed U.S. editors and printers are often thrown into a dither 
when a foreign word insinuates itself into the language. 
However there is one word on which Americans seem to have closed ranks, 
printing it confidently, courageously, and almost invariably 
complete with accent — the cheese presented to us as Munster. 

Unfortunately, Munster doesn't take an accent. 
— WAVERLEY ROOT, in the International Herald Tribune (1982) 
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Sometimes you want to tell TgX how big to make a space, or how wide to make 
a line. For example, the short story of Chapter 6 used the instruction ‘\vskip 
.5cm’ to skip vertically by half a centimeter, and we also said ‘\hsize=4in’ to 
specify a horizontal size of 4 inches. It’s time now to consider the various ways 
such dimensions can be communicated to TpX. 

“Points” and “picas” are the traditional units of measure for printers 
and compositors in English-speaking countries, so T^X understands points and 
picas. T^X also understands inches and metric units, as well as the continental 
European versions of points and picas. Each unit of measure is given a two letter 
abbreviation, as follows: 

pt point (baselines in this manual are 12 pt apart) 

pc pica (lpc = 12 pt) 

in inch (lin = 72.27pt) 

bp big point (72 bp = lin) 

cm centimeter (2.54cm = lin) 

mm millimeter (10 mm = 1cm) 

dd didot point (1157 dd = 1238 pt) 

cc cicero (1 cc = 12 dd) 

sp scaled point (65536 sp = lpt) 

The output of TgX is firmly grounded in the metric system, using the conversion 
factors shown here as exact ratios. 

► EXERCISE 10.1 

How many points are there in 254 centimeters? 

When you want to express some physical dimension to TgX, type it as 

(optional sign) (number) (unit of measure) 
or 

(optional sign) (digit string) . (digit string) (unit of measure) 

where an (optional sign) is either a ‘+’ or a or nothing at all, and where a 
(digit string) consists of zero or more consecutive decimal digits. The ‘ . ’ can 
also be a V- For example, here are six typical dimensions: 

3 in 29 pc 

- . 013837in + 42,1 dd 

O.mm 123456789sp 

A plus sign is redundant, but some people occasionally like extra redundancy 
once in a while. Blank spaces are optional before the signs and the numbers and 
the units of measure, and you can also put an optional space after the dimension; 
but you should not put spaces within the digits of a number or between the letters 
of the unit of measure. 


dimensions 

Points 

picas 

units of measure, table 
pt 

point 

pc 

pica 

in 

inch 

b P 

big point 
cm 

centimeter 

mm 

millimeter 

dd 

didot point 
Didot, F. A. 
cc 

cicero 

sp 

scaled point 
optional sign 
digit string 


► EXERCISE 10.2 

Arrange those six “typical dimensions” into order, from smallest to largest. 
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Which 


► EXERCISE 10.3 

Two of the following three dimensions are legitimate according to T^X’s rules, 
two are they? What do they mean? Why is the other one incorrect? 


’ . 77pt 
"Ccc 


-,sp 


The following “rulers” have been typeset by TgX so that you can get 
some idea of how different units compare to each other. If no distortion has been 
introduced during the camera work and printing processes that have taken place 
after TpX did its work, these rulers are highly accurate. 


| 1 ' 1 I 1 ' 1 | 1 ' 1 I 1 ' 1 | 1 ' 1 I 1 ' 1 | 1 ' 1 I 1 ' 1 |4in 


1 300 pt 

1— n 300 dd 


10 cm 


/>, ►EXERCISE 10.4 

JL (To be worked after you know about boxes and glue and have read Chapter 21. ) 
Explain how to typeset such a 10 cm ruler, using TgX. 

f TjrX represents all dimensions internally as an integer multiple of the tiny 
units called sp. Since the wavelength of visible light is approximately 100 sp, 
rounding errors of a few sp make no difference to the eye. However, TfrX does all 
of its arithmetic very carefully so that identical results will be obtained on different 
computers. Different implementations of T^rX will produce the same line breaks and 
the same page breaks when presented with the same document, because the integer 
arithmetic will be the same. 

f The units have been defined here so that precise conversion to sp is efficient 
on a wide variety of machines. In order to achieve this, TJ/X’s “pt” has been 
made slightly larger than the official printer’s point, which was defined to equal exactly 
.013837 in by the American Typefounders Association in 1886 [cf. National Bureau of 
Standards Circular 570 (1956)]. In fact, one classical point is exactly .99999999 pt, so 
the “error” is essentially one part in 10 s . This is more than two orders of magnitude 
less than the amount by which the inch itself changed during 1959, when it shrank to 
2.54 cm from its former value of (1/0.3937) cm; so there is no point in worrying about 
the difference. The new definition 72.27 pt = 1 in is not only better for calculation, it is 
also easier to remember. 

f l);X will not deal with dimensions whose absolute value is 2 30 sp or more. In 
other words, the maximum legal dimension is slightly less than 16384 pt. This 
is a distance of about 18.892 feet (5.7583 meters), so it won’t cramp your style. 
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machine-independence 

rounding 

maximum legal dimension 



Chapter 10: Dimensions 59 


In a language manual like this it is convenient to use “angle brackets” 
in abbreviations for various constructions like (number) and (optional sign) and 
(digit string) . Henceforth we shall use the term (dimen) to stand for a legitimate 
TgX dimension. For example, 

\hsize=(dimen) 

will be the general way to define the column width that T^X is supposed to use. 
The idea is that (dimen) can be replaced by any quantity like ‘4in’ that satisfies 
Tf^X’s grammatical rules for dimensions; abbreviations in angle brackets make it 
easy to state such laws of grammar. 

When a dimension is zero, you have to specify a unit of measure even 
though the unit is irrelevant. Don’t just say ‘0’ ; say ‘Opt’ or ‘Oin’ or something. 

The 10-point size of type that you are now reading is normal in text- 
books, but you probably will often find yourself wanting a larger font. Plain TgX 
makes it easy to do this by providing magnified output. If you say 

\magnif ication=1200 

at the beginning of your manuscript, everything will be enlarged by 20%; i.e., it 
will come out at 1.2 times the normal size. Similarly, ‘\magnif ication=2000’ 
doubles everything; this actually quadruples the area of each letter, since heights 
and widths are both doubled. To magnify a document by the factor /, you say 
\magnif ication=(number), where the (number) is 1000 times /. This instruc- 
tion must be given before the first page of output has been completed. You 
cannot apply two different magnifications to the same document. 

Magnification has obvious advantages: You’ll have less eyestrain when 
you’re proofreading; you can easily make transparencies for lectures; and you 
can photo-reduce magnified output, in order to minimize the deficiencies of a 
low-resolution printer. Conversely, you might even want ‘\magnif ication=500’ 
in order to create a pocket-size version of some book. But there’s a slight catch: 
You can’t use magnification unless your printing device happens to have the 
fonts that you need at the magnification you desire. In other words, you need 
to find out what sizes are available before you can magnify. Most installations 
of TgX make it possible to print all the fonts of plain TgX if you magnify by 
\magstepO, 1, 2, 3, and perhaps 4 or even 5 (see Chapter 4); but the use of large 
fonts can be expensive because a lot of system memory space is often required 
to store the shapes. 


► EXERCISE 10.5 

Try printing the short story of Chapter 6 at 1.2, 1.44, and 1.728 times the normal 
size. What should you type to get Tp’X to do this? 



When you say \magnif ication=2000, an operation like ‘\vskip.5cm’ will ac- 
tually skip 1.0 cm of space in the final document. If you want to specify a 


dimension in terms of the final size, Tj^X allows you to say ‘true’ just before pt, pc, in, 
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bp, cm, mm, dd, cc, and sp. This unmagnifies the units, so that the subsequent magni- 
fication will cancel out. For example, ‘\vskip . 5truecm’ is equivalent to ‘\vskip . 25cm’ 
if you have previously said ‘\magnif ication=2000’. Plain Tj?X uses this feature in the 
\magnif ication command itself: Appendix B includes the instruction 

\hsize = 6.5 true in 


just after a new magnification has taken effect. This adjusts the line width so that the 
material on each page will be 6| inches wide when it is finally printed, regardless of the 
magnification factor. There will be an inch of margin at both left and right, assuming 
that the paper is 8| inches wide. 

f lf you use no ‘true’ dimensions, TjrX’s internal computations are not affected 
by the presence or absence of magnification; line breaks and page breaks will 
be the same, and the dvi file will change in only two places. TjjX simply tells the 
printing routine that you want a certain magnification, and the printing routine will 
do the actual enlargement when it reads the dvi file. 


► EXERCISE 10.6 

Chapter 4 mentions that fonts of different magnifications can be used in the 
same job, by loading them ‘at’ different sizes. Explain what fonts will be used when 
you give the commands 



\magnif ication=\magstepl 
\font\f irst=cmrlO scaled\magstepl 
\font\second=cmrlO at 12truept 

/>)/>) Magnification is actually governed by TjrX’s \mag primitive, which is an integer 
JL JL parameter that should be positive and at most 32768. The value of \mag is 
examined in three cases: (1) just before the first page is shipped to the dvi file; (2) when 
computing a true dimension; (3) when the dvi file is being closed. Alternatively, some 
implementations of TjrX produce non-dvi output; they examine \mag in case (2) and 
also when shipping out each page. Since each document has only one magnification, 
the value of \mag must not change after it has first been examined. 



Tp'X also recognizes two units of measure that are relative rather than absolute; 
i.e. , they depend on the current context: 


em is the width of a “quad” in the current font; 
ex is the “x-height” of the current font. 

Each font defines its own em and ex values. In olden days, an “em” was the width 
of an ‘M’, but this is no longer true; ems are simply arbitrary units that come with a 
font, and so are exes. The Computer Modern fonts have the property that an em-dasli 
is one em wide, each of the digits 0 to 9 is half an em wide, and lowercase ‘x’ is one ex 
high; but these are not hard-and-fast rules for all fonts. The \rm font (cmrlO) of plain 
IjjX has lem = 10 pt and lex « 4.3 pt; the \bf font (cmbxlO) has lem = 11.5 pt and 
lex « 4.44 pt; and the \tt font (cmttlO) has lem = 10.5 pt and 1 ex 4.3 pt. All of 
these are “10-point” fonts, yet they have different em and ex values. It is generally best 
to use em for horizontal measurements and ex for vertical measurements that depend 
on the current font. 
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f A {dimen} can also refer to T^X’s internal registers or parameters. We shall 
discuss registers later, and a complete definition of everything that a {dimen} 
can be will be given in Chapter 24. For now it will suffice to give some hints about 
what is to come: ‘\hsize’ stands for the current horizontal line size, and ‘.5\hsize’ 
is half that amount; ‘2\wd3’ denotes twice the width of register \box3; ‘-\dimenlOO’ is 
the negative of register \dimenl00. 

($)&) Notice that the unit names in dimensions are not preceded by backslashes. The 
JL JL same is true of other so-called keywords of the T)gX language. Keywords can be 
given in uppercase letters or in a mixture of upper and lower case; e.g., ‘Pt’ is equivalent 
to ‘pt’. The category codes of these letters are irrelevant; you may, for example, be 
using a p of category 12 (other) that was generated by expanding ‘\the\hsize’ as 
explained in Chapter 20. T^X gives a special interpretation to keywords only when 
they appear in certain very restricted contexts. For example, ‘pt’ is a keyword only 
when it appears after a number in a {dimen}; ‘at’ is a keyword only when it appears 
after the external name of a font in a \font declaration. Here is a complete list of 
TgX’s keywords, in case you are wondering about the full set: at, bp, by, cc, cm, dd, 
depth, em, ex, fil, height, in, 1, minus, mm, mu, pc, plus, pt, scaled, sp, spread, to, 
true, width. (See Appendix I for references to the contexts in which each of these is 
recognized as a keyword.) 


dimen 
keywords 
reserved words 
RAPES. 

SESAME STREET 
Children’s Television Workshop 


The methods that have hitherto been taken 
to discover the measure of the Roman foot, 
will, upon examination, be found so unsatisfactory, that 
it is no wonder the learned are not yet agreed on that point. 

9 London inches are equal to 8,447 Paris inches. 
— MATTHEW RAPER, in Philosophical Transactions (1760) 

Without the letter U, 
units would be nits. 
— SESAME STREET (1970) 
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TgX makes complicated pages by starting with simple individual characters and 
putting them together in larger units, and putting these together in still larger 
units, and so on. Conceptually, it’s a big paste-up job. The T^Xmcal terms used 
to describe such page construction are boxes and glue. 

Boxes in TgX are two-dimensional things with a rectangular shape, hav- 
ing three associated measurements called height , width , and depth. Here is a 
picture of a typical box, showing its so-called reference point and baseline: 


Reference point — s- 



From TgX’s viewpoint, a single character from a font is a box; it’s one of the 
simplest kinds of boxes. The font designer has decided what the height, width, 
and depth of the character are, and what the symbol will look like when it is in 
the box; TgX uses these dimensions to paste boxes together, and ultimately to 
determine the locations of the reference points for all characters on a page. In 
plain Tf^X’s \rm font (cmrlO), for example, the letter ‘h’ has a height of 6.9444 
points, a width of 5.5555 points, and a depth of zero; the letter ‘g’ has a height 
of 4.3055 points, a width of 5 points, and a depth of 1.9444 points. Only certain 
special characters like parentheses have height plus depth actually equal to 10 
points, although cmrlO is said to be a “10-point” font. You needn’t bother to 
learn these measurements yourself, but it’s good to be aware of the fact that T^X 
deals with such information; then you can better understand what the computer 
does to your manuscript. 

The character shape need not fit inside the boundaries of its box. For 
example, some characters that are used to build up larger math symbols like 
matrix brackets intentionally protrude a little bit, so that they overlap properly 
with the rest of the symbol. Slanted letters frequently extend a little to the right 
of the box, as if the box were skewed right at the top and left at the bottom, 
keeping its baseline fixed. For example, compare the letter ‘g’ in the cmrlO and 
cmsllO fonts (\rm and \sl): 


boxes 

glue 

height 

width 

depth 

reference point 
baseline 
cmrlO 
cmsllO 


(A figure will be inserted here; too bad you can’t see it now. 
It shows two g’s, as claimed.) 


In both cases TgX thinks that the box is 5 points wide, so both letters get exactly 
the same treatment. TgX doesn’t have any idea where the ink will go — only the 
output device knows this. But the slanted letters will be spaced properly in spite 
of TgX’s lack of knowledge, because the baselines will match up. 
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Actually the font designer also tells TpX one other thing, the so-called 
italic correction : A number is specified for each character, telling roughly how 
far that character extends to the right of its box boundary, plus a little to spare. 
For example, the italic correction for ‘g’ in cmrlO is 0.1389 pt, while in cmsllO 
it is 0.8565 pt. Chapter 4 points out that this correction is added to the normal 
width if you type ! \/’ just after the character. You should remember to use \/ 
when shifting from a slanted font to an unslanted one, especially in cases like 

the so-called {\sl italic correction\/} : 

since no space intervenes here to compensate for the loss of slant. 

TpX also deals with another simple kind of box, which might be called 
a “black box,” namely, a rectangle like ‘|’ that is to be entirely filled with ink 
at printing time. You can specify any height, width, and depth you like for such 
boxes — but they had better not have too much area, or the printer might get 
upset. (Printers generally prefer white space to black space.) 

Usually these black boxes are made very skinny, so that they appear as 
horizontal lines or vertical lines. Printers traditionally call such lines “horizontal 
rules” and “vertical rules,” so the terms TpX uses to stand for black boxes are 
\hrule and \vrule. Even when the box is square, as in ' ■ you must call it 
either an \hrule or a \vrule. We shall discuss the use of rule boxes in greater 
detail later. (See Chapter 21.) 

Everything on a page that has been typeset by TpX is made up of simple 
character boxes or rule boxes, pasted together in combination. TpX pastes boxes 
together in two ways, either horizontally or vertically. When TpX builds a 
horizontal list of boxes, it lines them up so that their reference points appear 
in the same horizontal row; therefore the baselines of adjacent characters will 
match up as they should. Similarly, when TeX builds a vertical list of boxes, it 
lines them up so that their reference points appear in the same vertical column. 

Let’s take a look at what TpX does behind the scenes, by comparing 
the computer’s methods with what you would do if you were setting metal type 
by hand. In the time-tested traditional method, you choose the letters that 
you need out of a type case — the uppercase letters are in the upper case — and 
you put them into a “composing stick.” When a line is complete, you adjust 
the spacing and transfer the result to the “chase,” where it joins the other rows 
of type. Eventually you lock the type up tightly by adjusting external wedges 
called “quoins.” This isn’t much different from what TpX does, except that 
different words are used; when TpX locks up a line, it creates what is called an 
“hbox” (horizontal box), because the components of the line are pieced together 
horizontally. You can give an instruction like 

\hbox{A line of type.} 

in a TpX manuscript; this tells the computer to take boxes for the appropriate 
letters in the current font and to lock them up in an hbox. As far as TpX is 
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concerned, the letter ‘A’ is a box □ ’ and the letter ‘p’ is a box So the 
given instruction causes TgX to form the hbox 


representing ‘A line of type.’ The hboxes for individual lines of type are eventu- 
ally joined together by putting them into a “vbox” (vertical box). For example, 
you can say 

\vbox{\hbox{Two lines}\hbox{of type.}} 
and TgX will convert this into 

Two lines 
of type. 

The principal difference between Tp]X’s method and the old way is that metal 
types are generally cast so that each character has the same height and depth; 
this makes it easy to line them up by hand. TgX’s types have variable height 
and depth, because the computer has no trouble lining characters up by their 
baselines, and because the extra information about height and depth helps in 
the positioning of accents and mathematical symbols. 

Another important difference between TgX setting and hand setting is, of 
course, that TgX will choose line divisions automatically; you don’t have to insert 
\hbox and \vbox instructions unless you want to retain complete control over 
where each letter goes. On the other hand, if you do use \hbox and \vbox, you 
can make TpX do almost everything that Ben Franklin could do in his printer’s 
shop. You’re only giving up the ability to make the letters come out charmingly 
crooked or badly inked; for such effects you need to make a new font. (And 
of course you lose the tactile and olfactory sensations, and the thrill of doing 
everything by yourself. TgX will never completely replace the good old ways.) 

A page of text like the one you’re reading is itself a box, in TgX’s view: 
It is a largish box made from a vertical list of smaller boxes representing the lines 
of text. Each line of text, in turn, is a box made from a horizontal list of boxes 
representing the individual characters. In more complicated situations, involving 
mathematical formulas and/or complex tables, you can have boxes within boxes 
within boxes ... to any level. But even these complicated situations arise from 
horizontal or vertical lists of boxes pasted together in a simple way; all that you 
and TgX have to worry about is one list of boxes at a time. In fact, when you’re 
typing straight text, you don’t have to think about boxes at all, since T^X will 
automatically take responsibility for assembling the character boxes into words 
and the words into lines and the lines into pages. You need to be aware of the 
box concept only when you want to do something out of the ordinary, e.g., when 
you want to center a heading. 




From the standpoint of TgX’s digestive processes, a manuscript comes in as a 
sequence of tokens, and the tokens are to be transformed into a sequence of 


boxes. Each token of input is essentially an instruction or a piece of an instruction; for 
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example, the token ‘An’ normally means, “put a character box for the letter A at the 
end of the current hbox, using the current font” ; the token 1 |vskip| ’ normally means, 
“skip vertically in the current vbox by the (dirnen) specified in the following tokens.” 

f The height, width, or depth of a box might be negative, in which case it is a 
“shadow box” that is somewhat hard to draw. T]-'X doesn’t balk at negative 
dimensions; it just does arithmetic as usual. For example, the combined width of two 
adjacent boxes is the sum of their widths, whether or not the widths are positive. A 
font designer can declare a character’s width to be negative, in which case the character 
acts like a backspace. (Languages that read from right to left could be handled in this 
way, but only to a limited extent, since TjrX’s line-breaking algorithm is based on the 
assumption that words don’t have negative widths.) 

can raise or lower the individual boxes in a horizontal list; such adjust- 
ments take care of mathematical subscripts and superscripts, as well as the 
heights of accents and a few other things. For example, here is a way to make a box 
that contains the TjrX logo, putting it into TjjX’s internal register \boxO: 

\setboxO=\hbox{T\kern- . 1667em\lower . 5ex\hbox{E}\kern- . 125em X} 


Here ‘\kern- . 1667em’ means to insert blank space of —.1667 ems in the current font, 
i.e., to back up a bit; and ‘\lower.5ex’ means that the box \hbox{E} is to be lowered 
by half of the current x-heiglit, thus offsetting that box with respect to the others. 
Instead of ‘\lower.5ex’ one could also say Araise- . 5ex’. Chapters 12 and 21 discuss 
the details of how to construct boxes for special effects; our goal in the present chapter 
is merely to get a taste of the possibilities. 



file will 


TpX will exhibit the contents of any box register, if you ask it to. For example, 
if you type AshowboxO’ after setting \boxO to the TgX logo as above, your log 
contain the following niumbo jumbo: 


\hbox (6 . 83331+2 . 15277)xl8 .6108 
. \tenrm T 
Akern -1.66702 

Ahbox (6. 83331+0. 0)x6. 80557, shifted 2.15277 
. . \tenrm E 
Akern -1.25 
. \tenrm X 


The first line means that \boxO is an hbox whose height, depth, and width are re- 
spectively 6.83331 pt, 2.15277 pt, and 18.6108 pt. Subsequent lines beginning with 
indicate that they are Inside of a box. The first thing in this particular box is the 
letter T in font \tenrm; then comes a kern. The next item is an hbox that contains 
only the letter E; this box has the height, depth, and width of an E, and it has been 
shifted downward by 2.15277 pt (thereby accounting for the depth of the larger box). 



► EXERCISE 11.1 

Why are there two dots in the 1 . . \tenrm E’ line here? 


f Such displays of box contents will be discussed further in Chapters 12 and 17. 

They are used primarily for diagnostic purposes, when you are trying to figure 
out exactly what TjjX thinks it’s doing. The main reason for bringing them up in the 
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present chapter is simply to provide a glimpse of how TgX represents boxes in its 
guts. A computer program doesn’t really move boxes around; it fiddles with lists of 
representations of boxes. 


►EXERCISE 11.2 

JL By running TJ^X, figure out how it actually handles italic corrections to char- 
acters: How are the corrections represented inside a box? 



► EXERCISE 11.3 

The “opposite” of T)?X’s logo — namely. T^X — is produced by 


\setboxl=\hbox{T\kern+. 1667em\raise . 5ex\hbox{E}\kern+ . 125em X} 


What would \showboxl show now? (Try to guess, without running the machine.) 

► EXERCISE 11.4 

Why do you think the author of TfrX didn’t make boxes more symmetrical 
between horizontal and vertical, by allowing reference points to be inside the boundary 
instead of insisting that the reference point must appear at the left edge of each box? 



^►EXERCISE 11.5 

Construct a \demobox macro for use in writing manuals like this, so that an 
author can write ‘\demobox{Tough exercise.}’ in order to typeset ‘ P^rtl *■■] ’. 



EXERCISE 11.6 

Construct a \frac macro such that ‘\fracl/2’ yields 1 


1 h'- 


WALTON 

STEVENSON 

OSBOURNE 


I have several boxes in my memory 
in which I will keep them all very safe, 
there shall not a one of them be lost. 
— IZAAK WALTON, The Compleat Angler (1653) 

How very little does the amateur, dwelling at home at ease, 
comprehend the labours and perils of the author. 
— R. L. STEVENSON and L. OSBOURNE, The Wrong Box (1889) 
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But there’s more to the story than just boxes: There’s also some magic mortar 
called glue that Tj^X uses to paste boxes together. For example, there is a little 
space between the lines of text in this manual; it has been calculated so that 
the baselines of consecutive lines within a paragraph are exactly 12 points apart. 
And there is space between words too; such space is not an “empty” box, it 
is part of the glue between boxes. This glue can stretch or shrink so that the 
right-hand margin of each page comes out looking straight. 

When TgX makes a large box from a horizontal or vertical list of smaller 
boxes, there often is glue between the smaller boxes. Glue has three attributes, 
namely its natural space, its ability to stretch, and its ability to shrink. 

In order to understand how this works, consider the following example 
of four boxes in a horizontal list separated by three globs of glue: 



shrink 1 shrink 2 shrink 0 

< width 52 y 


glue 

leading, see baselineskip 
skipping space, see glue 
stretch 
shrink 

natural width 


The first glue element has 9 units of space, 3 of stretch, and 1 of shrink; the next 
one also has 9 units of space, but 6 units of stretch and 2 of shrink; the last one 
has 12 units of space, but it is unable to stretch or to shrink, so it will remain 
12 units of space no matter what. 

The total width of boxes and glue in this example, considering only the 
space components of the glue, is 5 + 9 + 6 + 9 + 3 + 12 + 8 = 52 units. This 
is called the natural width of the horizontal list; it’s the preferred way to paste 
the boxes together. Suppose, however, that TgX is told to make the horizontal 
list into a box that is 58 units wide; then the glue has to stretch by 6 units. 
Well, there are 3 + 6 + 0 = 9 units of stretchability present, so TgX multiplies 
each unit of stretchability by 6/9 in order to obtain the extra 6 units needed. 
The first glob of glue becomes 9 + (6/9) x 3 = 11 units wide, the next becomes 
9 + (6/9) x 6 = 13 units wide, the last remains 12 units wide, and we obtain the 
desired box looking like this: 



t width 58 >- 


On the other hand, if Tj^X is supposed to make a box 51 units wide from 
the given list, it is necessary for the glue to shrink by a total of one unit. There 
are three units of shrinkability present, so the first glob of glue would shrink by 
1/3 and the second by 2/3. 
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The process of determining glue thickness when a box is being made 
from a horizontal or vertical list is called setting the glue. Once glue has been 
set, it becomes rigid; it won’t stretch or shrink any more, and the resulting box 
is essentially indecomposable. 

Glue will never shrink more than its stated shrinkability. For example, 
the first glob of glue in our illustration will never be allowed to become narrower 
than 8 units wide, and TpX will never shrink the given horizontal list to make 
its total width less than 49 units. But glue is allowed to stretch arbitrarily far, 
whenever it has a positive stretch component. 

► EXERCISE 12.1 

How wide would the glue globs be if the horizontal list in the illustration were 
to be made 100 units wide? 


Once you understand TgX’s concept of glue, you may well decide that 
it was misnamed; real glue doesn’t stretch or shrink in such ways, nor does it 
contribute much space between boxes that it welds together. Another word like 
“spring” would be much closer to the essential idea, since springs have a nat- 
ural width, and since different springs compress and expand at different rates 
under tension. But whenever the author has suggested changing TgX’s termi- 
nology, numerous people have said that they like the word “glue” in spite of its 
inappropriateness; so the original name has stuck. 

f TgX is somewhat reluctant to stretch glue more than the stated stretchability; 

therefore you can decide how big to make each aspect of the glue by using the 
following rules: (a) The natural glue space should be the amount of space that looks 
best. (b) The glue stretch should be the maximum amount of space that can be 
added to the natural spacing before the layout begins to look bad. (c) The glue shrink 
should be the maximum amount of space that can be subtracted from the natural 
spacing before the layout begins to look bad. 


In most cases the designer of a book layout will have specified all the 
kinds of glue that are to be used, so a typist will not need to decide how big 
any glue attributes should be. For example, users of the plain Tf^X format of 
Appendix B can type ‘\smallskip’ when they want a little extra space between 
paragraphs; a \smallskip turns out to be 3pt worth of vertical glue that can 
stretch or shrink by an additional lpt. Here is a \smallskip: 

Instead of sprinkling various amounts of glue throughout a manuscript, express- 
ing each of them explicitly in terms of points, you will find it much better 
to explain your intentions more clearly by typing something like ‘\smallskip’ 
when you want abnormal spacing. The definition of \smallskip can readily 
be changed later, in case you want such spaces to be smaller or larger. Plain 
TgX also provides you with ‘\medskip’, which is worth two smallskips, and 
‘\bigskip’, which is worth two medskips. 



A plain T^rX \medskip appears before and after each “dangerous bend” section 
of this manual, so you have already seen numerous examples of such spacing 
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before you knew what it was called. Vertical glue is created by writing ‘\vskip{glue)\ 
where (glue) is any glue specification. The usual way to specify (glue) to TjgX is 

{dimen) plus(dimen) minus (dimen) 

where the ‘plus(dimen) 1 and ‘minus (dimen)’ are optional and assumed to be zero if not 
present; ‘plus’ introduces the amount of stretcliability, ‘minus’ introduces the amount 
of shrinkability. For example, Appendix B defines \medskip to be an abbreviation for 
‘\vskip6pt plus2pt minus2pt’. The normal-space component of glue must always be 
given as an explicit (dimen), even when it is zero. 

f Horizontal glue is created in the same way, but with \hskip instead of \vskip. 

For example, plain Tj?X defines \enskip as an abbreviation for the command 
‘\hskip. 5em\relax’; this skips horizontally by one “en,” i.e., by exactly half of an em 
in the current font. There is no stretching or shrinking in an \enskip. The control 
sequence \relax after ‘ . 5em’ prevents IjgX from thinking that a keyword is present, in 
case the text following \enskip just happens to begin with ‘plus’ or ‘minus’. 

One of the interesting things that happens when glue stretches and 
shrinks at different rates is that there might be glue with infinite stretchabil- 
itv. For example, consider again the four boxes we had at the beginning of this 
chapter, with the same glue as before except that the glue in the middle can 
stretch infinitely far. Now the total stretcliability is infinite; and when the line 
has to grow, all of the additional space is put into the middle glue. If, for ex- 
ample, a box of width 58 is desired, the middle glue expands from 9 to 15 units, 
and the other spacing remains unchanged. 

If such infinitely stretchable glue is placed at the left of a row of boxes, 
the effect is to place them “flush right,” i.e., to move them over to the rightmost 
boundary of the constructed box. And if you take two globs of infinitely stretch- 
able glue, putting one at the left and one at the right, the effect is to center the 
list of boxes within a larger box. This in fact is how the \centerline instruction 
works in plain TgX: It places infinite glue at both ends, then makes a box whose 
width is the current value of \hsize. 

The short story example of Chapter 6 used infinite glue not only for 
centering, but also in the \vfill instruction at the end; ‘\vfill’ essentially 
means “skip vertically by zero, but with infinite stretcliability.” In other words, 
\vf ill fills up the rest of the current page with blank space. 

f TjgX actually recognizes several kinds of infinity, some of which are “more 
infinite” than others. You can say both \vfil and \vfill; the second is 
stronger than the first. In other words, if no other infinite stretcliability is present, 
\vf il will expand to fill the remaining space; but if both \vf il and \vf ill are present 
simultaneously, the \vfill effectively prevents \vfil from stretching. You can think 
of it as if \vfil has one mile of stretcliability, while \vfill has a trillion miles. 

f Besides \vfil and \vfill, TjrX has \hfil and \hfill, for stretching indefi- 
nitely in the horizontal direction. You can also say \hss or \vss, in order to 
get glue that is infinitely shrinkable as well as infinitely stretchable. (The name ‘\hss’ 
stands for “horizontal stretch or shrink”; ‘\vss’ is its vertical counterpart.) Finally, the 
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primitives \hfilneg and \vfilneg will cancel the stretchability of \hfil and \vfil; 
we shall discuss applications of these curious glues later. 




Here are some examples of \hfil, using the Mine macro of plain TpX, which 
creates an hbox whose width is the current \hsize: 

\line{This text will be flush left.\hfil} 

\line{\hfil This text will be flush right.} 

\line{\hfil This text will be centered. \hfil} 

\line{Some text flush left\hfil and some flush right.} 
\line{Alpha\hf il centered between Alpha and Omega\hfil Omega} 
\line{Five\hf il words\hfil equally\hfil spaced\hfil out.} 

► EXERCISE 12.2 

Describe the result of 



\line{\hf il\hf il What happens now?\hfil} 
\line{\hf ill\hf il and now?\hfil} 

► EXERCISE 12.3 

How do the following three macros behave differently? 


\def \ center linea#l{\line{\hf il#l\hf il}} 
\def\centerlineb#l{\line{\hf ill#l\hf ill}} 
\def \ cent er line c#l{\line{\hss#l\hss}} 


f in order to specify such infinities, you are allowed to use the special units ‘f il’, 
‘fill’, and ‘filll’ in the (dimen) parts of a stretchability or shrinkability 
component. For example, \vf il, \vf ill, \vss, and \vf ilneg are essentially equivalent 
to the glue specifications 


\vskip Opt plus lfil 
\vskip Opt plus lfill 
\vskip Opt plus lfil minus lfil 
\vskip Opt plus -lfil 


respectively. It’s usually best to stick to the first order infinity (fil) as much as you can, 
resorting to second order (fill) only when you really need something extremely infinite. 
Then the ultimate order (filll) is always available as a last resort in emergencies. (TgX 
does not provide a ‘\vfilll’ primitive, since the use of this highest infinity is not 
encouraged.) You can use fractional multiples of infinity like ‘3.25fil’, as long as 
you stick to fewer than 16384 fil units. TpX actually does its calculations with integer 
multiples of 2 — 16 fil (or fill or filll); so 0. 000007filll turns out to be indistinguishable 
from Opt, but O.OOOOlfilll is infinitely greater than 16383. 99999f ill. 


Now 7 here’s something important for all T^Xmcal typists to know: Plain 
TgX puts extra space at the end of a sentence; furthermore, it automatically 
increases the stretchability (and decreases the shrinkability) after punctuation 
marks. The reason is that it’s usually better to put more space after punctua- 
tion than between two ordinary words, when spreading a line out to reach the 
desired margins. Consider, for example, the following sentences from a classic 
kindergarten pre-primer: 


hfilneg 

vfilneg 

line 

flush left 

fil 

fill 

filll 

vfilll 

sentence 

punctuation 

Dick and Jane 


“Oh, oh! ’ ’ 


cried Baby Sally. Dick and Jane laughed. 
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If TpX sets this at its natural width, all the spaces will be the same, except after 
the quote and after ‘Baby Sally.’: 

“Oh, oh!” cried Baby Sally. Dick and Jane laughed. 

But if the line needs to be expanded by 5 points, 10 points, 15 points, or more, 
TgX will set it as 

“Oh, oh!” cried Baby Sally. Dick and Jane laughed. 

“Oh, oh!” cried Baby Sally. Dick and Jane laughed. 

“Oh, oh!” cried Baby Sally. Dick and Jane laughed. 

“Oh, oh!” cried Baby Sally. Dick and Jane laughed. 

The glue after the comma stretches at 1.25 times the rate of the glue between 
adjacent words; the glue after the period and after the ! ’ ’ stretches at 3 times 
the rate. There is no glue between adjacent letters, so individual words will 
always look the same. If Ff'X had to shrink this line to its minimum width, the 
result would be 

“Oh, oh!” cried Baby Sally. Dick and Jane laughed. 

The glue after a comma shrinks only 80 percent as much as ordinary inter-word 
glue, and after a period or exclamation point or question mark it shrinks by only 
one third as much. 

This all makes for nice-looking output, but it unfortunately adds a bit 
of a burden to your job as a typist, because Tj^X’s rule for determining the end of 
a sentence doesn’t always work. The problem is that a period sometimes comes 
in the middle of a sentence . . . like when it is used (as here) to make an “ellipsis” 
of three dots. 

Moreover, if you try to specify ‘. . .’ by typing three periods in a row, 
you get ‘...’ — the dots are too close together. One way to handle this is to go 
into mathematics mode, using the \ldots control sequence defined in plain T^X 
format. For example, if you type 

Hmmm $\ldots$ I wonder why? 

the result is ‘Hmmm ... I wonder why?’. This works because math formulas are 
exempt from the normal text spacing rules. Chapter 18 has more to say about 
\ldots and related topics. 

Abbreviations present problems too. For example, the short story in 
Chapter 6 referred to ‘Mr. Drofnats’; TgX must be told somehow that the period 
after ‘Mr.’ or ‘Mrs.’ or ‘Ms.’ or ‘Prof.’ or ‘Dr.’ or ‘Rt. Hon.’, etc., doesn’t count 
as a sentence-ending full stop. 

We avoided that embarrassment in Chapter 6 by typing ‘Mr . "Drofnats’; 
the “tie” mark ~ tells plain TgX to insert a normal space, and to refrain from 
breaking between lines at that space. Another way to get TgX to put out a 
normal space is to type ‘\ u ’ (control space); e.g., ‘Mr.\ Drofnats’ would be 
almost the same as ‘Mr. "Drofnats’, except that a line might end after the ‘Mr.’. 
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The tie mark is best for abbreviations within a name, and after several 
other common abbreviations like ‘Fig.’ and ‘cf.’ and ‘vs.’ and ‘resp.’; you will 
find that it’s easy to train yourself to type ‘cf . ~Fig. ~5’. In fact, it’s usually 
wise to type ~ (instead of a space) just after a common abbreviation that occurs 
in the middle of a sentence. Manuals of style will tell you that the abbreviations 
‘e.g.’ and ‘i.e. ’ should always be followed by commas, never by spaces, so those 
particular cases shouldn’t need any special treatment. 

The only remaining abbreviations that arise with significant frequency 
occur in bibliographic references; control spaces are appropriate here. If, for 
example, you are typing a manuscript that refers to ‘Proc. Amer. Math. Soc.’, 
you should say 

Proc.\ Amer.\ Math.\ Soc. 


control spaces 

interword spacing 

User 

Drofnats 

Knuth 

frenchspacing 
nonfrenchspacing 
sophisticated spacing 
Presume 


Granted that this input looks a bit ugly, it makes the output look right. It’s one 
of the things we occasionally must do when dealing with a computer that tries 
to be smart. 


► EXERCISE 12.4 

Explain how to type the following sentence: “Mr. & Mrs. User were married by 
Rev. Drofnats, who preached on Matt. 19 : 3-9.” 


► EXERCISE 12.5 

Put the following bibliographic reference into plain TgX language: Donald E. 
Knuth, “Mathematical typography,” Bull. Amer. Math. Soc. 1 (1979), 337-372. 

On the other hand, if you don’t care about such refinements of spacing 
you can tell plain TgX to make all spaces the same, regardless of punctuation 
marks, by simply typing ‘\frenchspacing’ at the beginning of your manuscript. 
French spacing looks like this: 

“Oh, oh!” cried Baby Sally. Dick and Jane laughed. 


You can also shift back and forth between the two styles, either by saying 
‘\nonfrenchspacing’ to establish sophisticated spacing, or by making your use 
of \frenchspacing local to some group. For example, you might want to use 
French spacing only when typing the bibliography of some document. 

f TpX doesn’t consider a period or question mark or exclamation point to be 
the end of a sentence if the preceding character is an uppercase letter, since 
JRX assumes that such uppercase letters are most likely somebody’s initials. Thus, for 
example, the ‘V is unnecessary after the ‘I.’ in ‘Dr . ~Livingstone~I . \ Presume’; that 
particular period is not assumed to be a full stop. 


/■£> ►EXERCISE 12.6 

J_ What can you do to make JRX recognize the ends of sentences that do end with 
uppercase letters (e.g., ‘. . . launched by NASA.’ or ‘Did I?’ or ‘. . . see Appendix A.’)? 
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You can see the glue that TpX puts between words by looking at the contents 
of hboxes in the internal diagnostic format that we discussed briefly in Chap- 
ter 11. For example, Baby Sally’s exclamation begins as follows, after IgX has digested 
it and put it into a box, assuming \nonfrenchspacing: 

. \tenrm \ (ligature ‘‘) 

. \tenrm 0 
. \tenrm h 
. \tenrm , 

. \glue 3.33333 plus 2.08331 minus 0.88889 
. \tenrm o 
. \tenrm h 
. \tenrm ! 

.\tenrm " (ligature ’’) 

. \glue 4.44444 plus 4.99997 minus 0.37036 
. \tenrm c 
. \tenrm r 
. \tenrm i 
. \tenrm e 
. \tenrm d 

. \glue 3.33333 plus 1.66666 minus 1.11111 
. \tenrm B 
. \tenrm a 
. \tenrm b 
.\kern-0. 27779 
. \tenrm y 

. \glue 3.33333 plus 1.66666 minus 1.11111 
. \tenrm S 
. \tenrm a 
. \tenrm 1 
. \tenrm 1 
. \tenrm y 
.\kern-0. 83334 
. \tenrm . 

. \glue 4.44444 plus 4.99997 minus 0.37036 

The normal interword glue in font \tenrm is 3.33333 pt, plus 1.66666 pt of stretcliability, 
minus 1.11111 pt of shrinkability. Notice that the interword \glue in this list stretches 
more, and shrinks less, after the punctuation marks; and the natural space is in fact 
larger at the end of each sentence. This example also shows several other things that 
TjrX does while it processes the sample line of text: It converts ‘ ‘ and ’ ’ into single 
characters, i.e. , ligatures; and it inserts small kerns in two places to improve the spacing. 
A \kern is similar to glue, but it is not the same, because kerns cannot stretch or shrink; 
furthermore, TJrX will never break a line at a kern, unless that kern is immediately 
followed by glue. 

/>)/>) You may be wondering what T^rX’s rules for interword glue really are, exactly. 
JL JL For example, how did remember the effect of Baby Sally’s exclamation 
point, when quotation marks intervened before the next space? The details are slightly 
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tricky, but not incomprehensible. When TjgX is processing a horizontal list of boxes 
and glue, it keeps track of a positive integer called the current “space factor.” The 
space factor is normally 1000, which means that the interword glue should not be 
modified. If the space factor / is different from 1000, the interword glue is computed 
as follows: Take the normal space glue for the current font, and add the extra space 
if / > 2000. (Each font specifies a normal space, normal stretch, normal shrink, 
and extra space; for example, these quantities are 3.33333 pt, 1.66666 pt, 1.11111 pt, 
and 1.11111 pt, respectively, in cmrlO. We’ll discuss such font parameters in greater 
detail later.) Then the stretch component is multiplied by // 1000, while the shrink 
component is multiplied by 1000//. 

/>)/>) However, TjgX has two parameters \spaceskip and \xspaceskip that allow 
JL JL you to override the normal spacing of the current font. If / > 2000 and if 
\xspaceskip is nonzero, the \xspaceskip glue is used for an interword space. Other- 
wise if \spaceskip is nonzero, the \spaceskip glue is used, with stretch and shrink 
components multiplied by //1000 and 1000//. For example, the \raggedright macro 
of plain TjgX uses \spaceskip and \xspaceskip to suppress all stretching and shrinking 
of interword spaces. 

/gb/gb The space factor / is 1000 at the beginning of a horizontal list, and it is set to 
JL JL 1000 just after a non-character box or a math formula has been put onto the 
current horizontal list. You can say l \spacefactor=(number)’ to assign any particular 
value to the space factor; but ordinarily, / gets set to a number other than 1000 only 
when a simple character box goes on the list. Each character has a space factor code, 
and when a character whose space factor code is g enters the current list the normal 
procedure is simply to assign g as the new space factor. However, if g is zero, / is not 
changed; and if / < 1000 < g, the space factor is set to 1000. (In other words, / 
doesn’t jump from a value less than 1000 to a value greater than 1000 in a single step.) 
The maximum space factor is 32767 (which is much higher than anybody would ever 
want to use). 

/gb/gb When INITEX creates a brand new TgX, all characters have a space factor code 
JL JL of 1000, except that the uppercase letters ‘A’ through ‘Z’ have code 999. (This 
slight difference is what makes punctuation act differently after an uppercase letter; do 
you see why?) Plain TjgX redefines a few of these codes using the \sfcode primitive, 
which is similar to \catcode (see Appendix B); for example, the instructions 


\sf code ‘ ) =0 


\sf code ‘ . =3000 


make right parentheses “transparent” to the space factor, while tripling the stretclia- 
bility after periods. The \frenchspacing operation resets \sfcode‘ . to 1000. 

When ligatures are formed, or when a special character is specified via \char, 
JL JL the space factor code is computed from the individual characters that gener- 
ated the ligature. For example, plain T^X sets the space factor code for single-right- 
quote to zero, so that the effects of punctuation will be propagated. Two adjacent 
characters ’’ combine to form a ligature that is in character position ' 042 ; but the 
space factor code of this double-riglit-quote ligature is never examined by TjgX, so plain 
TjgX does not assign any value to \sf code’ 042. 

<§)<§)*■ EXERCISE 12.7 

What are the space factors after each token of the Dick-and-Jane example? 
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f Here’s the way TjrjX goes about setting the glue when an hbox is being wrapped 
up: The natural width, x, of the box contents is determined by adding up the 
widths of the boxes and kerns inside, together with the natural widths of all the glue 
inside. Furthermore the total amount of glue stretchability and shrinkability in the 
box is computed; let’s say that there’s a total of y o + y 1 fil + y -2 fill + ys filll available 
for stretching and so + si fil + S 2 fill + S 3 filll available for shrinking. Now the natural 
width x is compared to the desired width w. If x = in, all glue gets its natural width. 
Otherwise the glue will be modified, by computing a “glue set ratio” r and a “glue set 
order” i in the following way: (a) I f x < w. TjsX attempts to stretch the contents of 
the box; the glue order is the highest subscript i such that yt is nonzero, and the glue 
ratio is r = (w — x)/yi. (If yo = yi = yi = i/3 = 0, there’s no stretchability; both i 
and r are set to zero.) (b) If x > w, TjrX attempts to shrink the contents of the box 
in a similar way; the glue order is the highest subscript i such that ^ 0, and the 
glue ratio is normally r = (x — w)/s{. However, r is set to 1.0 in the case i = 0 and 
x — w > zo, because the maximum shrinkability must not be exceeded, (c) Finally, 
every glob of glue in the horizontal list being boxed is modified. Suppose the glue has 
natural width u, stretchability y, and shrinkability z, where y is a jth order infinity 
and s is a kth order infinity. Then if x < w (stretching), this glue takes the new width 
u. + ry if j = i; it keeps its natural width u if j 7 ! i. If x > w (shrinking), this glue 
takes the new width u — rz if k = i\ it keeps its natural width u if k ^ i. Notice that 
stretching or shrinking occurs only when the glue has the highest order of infinity that 
doesn’t cancel out. 

f TjrX will construct an hbox that has a given width w if you issue the command 
‘\hbox to (dimen) {{contents of box)}’, where w is the value of the {dimen). 
For example, the Mine macro discussed earlier in this chapter is simply an abbreviation 
for ‘\hbox to\hsize’. TjgX also allows you to specify the exact amount of stretching 
or shrinking; the command ‘\hbox spread{dimen){ {contents of box)}’ creates a box 
whose width w is a given amount more than the natural width of the contents. For 
example, one of the boxes displayed earlier in this chapter was generated by 


\hbox spread 5pt{‘‘0h, oh!’’ 


laughed.} 


In the simplest case, when you just want a box to have its natural width, you don’t 
have to write ‘\hbox spread Opt’; you can simply say ‘\hbox{ {contents of box)}’. 

f The baseline of a constructed hbox is the common baseline of the boxes inside. 

(More precisely, it’s the common baseline that they would share if they weren’t 
raised or lowered.) The height and depth of a constructed hbox are determined by the 
maximum distances by which the interior boxes reach above and below the baseline, 
respectively. The result of \hbox never has negative height or negative depth, but the 
width can be negative. 

►EXERCISE 12.8 

JL Assume that \boxl is 1 pt high, 1 pt deep, and 1 pt wide; \box2 is 2 pt high, 
2 pt deep, and 2 pt wide. A third box is formed by saying 

\setbox3=\hbox to3pt{\hf il\lower3pt\boxl\hskip-3pt plus3f il\box2} 

What are the height, depth, and width of \box3? Describe the position of the reference 
points of boxes 1 and 2 with respect to the reference point of box 3. 
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The process of setting glue for vboxes is similar to that for hboxes; but. before 
we study the \vbox operation, we need to discuss how TgX stacks boxes up 
vertically so that their baselines tend to be a fixed distance apart. The boxes in a 
horizontal list often touch each other, but it’s usually wrong to do this in a vertical list; 
imagine how awful a page would look if its lines of type were brought closer together 
whenever they didn’t contain tall letters, or whenever they didn’t contain any letters 
that descended below the baseline. 



TJrX’s solution to this problem involves three primitives called \baselineskip, 
\lineskip, and \lineskiplimit. A format designer chooses values of these 
three quantities by writing 



\baselineskip={glue) 

\lineskip={glue) 

Mine skiplimit={dimen) 


and the interpretation is essentially this: Whenever a box is added to a vertical list, T[-.X 
inserts “interline glue” intended to make the distance between the baseline of the new 
box and the baseline of the previous box exactly equal to the value of \baselineskip. 
But if the interline glue calculated by this rule would cause the top edge of the new 
box to be closer than Mineskiplimit to the bottom edge of the previous box, then 
\lineskip is used as the interline glue. In other words, the distance between adjacent 
baselines will be the \baselineskip setting, unless that would bring the boxes too 
close together; the \lineskip glue will separate adjacent boxes in the latter case. 

f Tlie rules for interline glue in the previous paragraph are carried out without 
regard to other kinds of glue that might be present; all vertical spacing due 
to explicit appearances of \vskip and \kern acts independently of the interline glue. 
Thus, for example, a \smallskip between two lines always makes their baselines further 
apart than usual, by the amount of a \smallskip; it does not affect the decision about 
whether \lineskip glue is used between those lines. 

f For example, let’s suppose that \baselineskip=12pt plus 2pt, \lineskip= 
3pt minus lpt, and \lineskiplimit=2pt. (These values aren’t particularly 
useful; they have simply been chosen to illustrate the rules.) Suppose further that a 
box whose depth is 3 pt was most recently added to the current vertical list; we are 
about to add a new box whose height is h. If h = 5 pt, the interline glue will be 
4 pt plus 2pt, since this will make the baselines 12 pt plus 2pt apart when we add h 
and the previous depth to the interline glue. But if h = 8pt, the interline glue will 
be 3pt minus lpt, since \lineskip will be chosen in order to keep from violating the 
given Mineskiplimit when stretching and shrinking are ignored. 

f Wlien you are typesetting a document that spans several pages, it’s generally 
best to define the \baselineskip so that it cannot stretch or shrink, because 
this will give more uniformity to the pages. A small variation in the distance between 
baselines — say only half a point — can make a substantial difference in the appearance 
of the type, since it significantly affects the proportion of white to black. On the 
other hand, if you are preparing a one-page document, you might want to give the 
baselineskip some stretchability, so that TjjX will help you fit the copy on the page. 


baselineskip 
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interline glue 
smallskip 


► EXERCISE 12.9 

What settings of \baselineskip, \lineskip, and \lineskiplimit will cause 
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the interline glue to be a “continuous” function of the next box height (i.e. , the interline 
glue will never change a lot when the box height changes only a little)? 


f A study of TJtX’s internal box-and-glue representation should help to firm 
up some of these ideas. Here is an excerpt from the vertical list that TgX 
constructed when it was typesetting this very paragraph: 


\glue 6.0 plus 2.0 minus 2.0 
\glue (\parskip) 0.0 plus 1.0 
\glue (\baselineskip) 1.25 

\hbox (7.5+1. 93748) x312 . 0, glue set 0.80154, shifted 36.0 [] 
\penalty 10000 

\glue (\baselineskip) 2.81252 

\hbox (6 . 25+1 . 93748) x312 . 0 , glue set 0.5816, shifted 36.0 [] 
\penalty 50 

\glue (\baselineskip) 2.81252 

\hbox (6.25+1. 75)x348. 0, glue set 116.70227fil [] 

\penalty 10000 

\glue (\abovedisplayskip) 6.0 plus 3.0 minus 1.0 

\glue (\lineskip) 1.0 

\hbox( 149. 25+0. 74998) x348.0 [] 


The first \glue in this example is the \medskip that precedes each dangerous-bend 
paragraph. Then comes the \parskip glue, which is automatically supplied before 
the first line of a new paragraph. Then comes some interline glue of 1.25 pt; it was 
calculated to make a total of 11 pt when the height of the next box (7.5 pt) and the 
depth of the previous box were added. (The previous box is not shown — it’s the 
bottom line of exercise 12.9 — but we can deduce that its depth was 2.25 pt.) The 
\hbox that follows is the first line of this paragraph; it has been shifted right 36 pt 
because of hanging indentation. The glue set ratio for this hbox is 0.80154; i.e., the 
glue inside is stretched by 80.154% of its stretchability. (In the case of shrinking, 
the ratio following ‘glue set’ would have been preceded by ’; hence we know that 
stretching is involved here. ) Tj?X has put 1 [] ’ at the end of each hbox line to indicate 
that there’s something in the box that isn’t shown. (The box contents would have 
been displayed completely, if \showboxdepth had been set higher.) The \penalty 
indications are used to discourage bad breaks between pages, as we will see later. The 
third hbox has a glue ratio of 116.70227, which applies to first-order-infinite stretching 
(i.e., fil); this results from an \hfil that was implicitly inserted just before the displayed 
material, to fill up the third line of the paragraph. Finally the big hbox whose height 
is 149.25 pt causes \lineskip to be the interline glue. This large box contains the 
individual lines of typewriter type that are displayed; they have been packaged into a 
single box so that they cannot be split between pages. Careful study of this example 
will teach you a lot about IjgX’s inner workings. 



Exception: No interline glue is inserted before or after a rule box. You can 
also inhibit interline glue by saying \nointerlineskip between boxes. 


/>)/>) TeX’s implementation of interline glue involves another primitive quantity 
JL JL called \prevdepth, which usually contains the depth of the most recent box 
on the current vertical list. However, \prevdepth is set to the sentinel value — 1000 pt 


internal box-and-glue representat 

medskip 

parskip 

hanging indentation 

glue set 

showboxdepth 

penalty 

nointerlineskip 

prevdepth 



80 Chapter 12: Glue 


at the beginning of a vertical list, or just after a rule box; this serves to suppress the 
next interline glue. The user can change the value of \prevdepth at any time when 
building a vertical list; thus, for example, the \nointerlineskip macro of Appendix B 
simply expands to ‘\prevdepth=-1000pt’. 

Here are the exact rules by which TjgX calculates the interline glue between 
JL JL boxes: Assume that a new box of height h (not a rule box) is about to 
be appended to the bottom of the current vertical list, and let \prevdepth = p, 
\lineskiplimit = l, \baselineskip = (b plus y minus z). If p < — 1000 pt, no in- 
terline glue is added. Otherwise if b — p — h > l, the interline glue ‘(b — p — h) plus y 
minus z' will be appended just above the new box. Otherwise the \lineskip glue will 
be appended. Finally, \prevdepth is set to the depth of the new box. 

EXERCISE 12.10 

JL JL Mr. B. L. User had an application in which he wanted to put a number of 
boxes together in a vertical list, with no space between them. He didn’t want to say 
\nointerlineskip after each box; so he decided to set \baselineskip, \lineskip, and 
\lineskiplimit all equal to Opt. Did this work? 

f The vertical analog of \hbox is \vbox, and Ij^X will obey the commands ‘\vbox 
to(dimen)’ and ‘\vbox spread(dimen)’ in about the way you would expect, 
by analogy with the horizontal case. However, there’s a slight complication because 
boxes have both height and depth in the vertical direction, while they have only width 
in the horizontal direction. The dimension in a \vbox command refers to the final 
height of the vbox, so that, for example, ‘\vbox to 50pt{. . .}’ produces a box that 
is 50 pt high; this is appropriate because everything that can stretch or shrink inside a 
vbox appears in the part that contributes to the height, while the depth is unaffected 
by glue setting. 

f The depth of a constructed \vbox is best thought of as the depth of the bottom 
box inside. Thus, a vbox is conceptually built by taking a bunch of boxes and 
arranging them so that their reference points are lined up vertically; then the reference 
point of the lowest box is taken as the reference point of the whole, and the glue is set 
so that the final height has some desired value. 

f However, this description of vboxes glosses over some technicalities that come 
up when you consider unusual cases. For example, TgX allows you to shift 
boxes in a vertical list to the right or to the left by saying l \moveright(dimen) (box)’ 
or ‘\moveleft(dimen) (box)’; this is like the ability to \raise or Xlower boxes in a 
horizontal list, and it implies that the reference points inside a vbox need not always 
lie in a vertical line. Furthermore, it is necessary to guard against boxes that have 
too much depth, lest they extend too far into the bottom margin of a page; and later 
chapters will point out that vertical lists can contain other things like penalties and 
marks, in addition to boxes and glue. 

Therefore, the actual rules for the depth of a constructed vbox are somewhat 
JL JL Tf?,Xmcal. Here they are: Given a vertical list that is being wrapped up via 
\vbox, the problem is to determine its natural depth. (1) If the vertical list contains no 
boxes, the depth is zero. (2) If there’s at least one box, but if the final box is followed 
by kerning or glue, possibly with intervening penalties or other things, the depth is zero. 
(3) If there’s at least one box, and if the final box is not followed by kerning or glue, 
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the depth is the depth of that box. (4) However, if the depth computed by rules (1), 
(2), or (3) exceeds \boxmaxdepth, the depth will be the current value of \boxmaxdepth. 
(Plain TjrX sets \boxmaxdepth to the largest possible dimension; therefore rule (4) 
won’t apply unless you specify a smaller value. When rule (4) does decrease the depth, 
TjgX adds the excess depth to the box’s natural height, essentially moving the reference 
point down until the depth has been reduced to the stated maximum.) 

f Tlre glue is set in a vbox just as in an lrbox, by determining a glue set ratio 
and a glue set order, based on the difference between the natural height x and 
the desired height w, and based on the amounts of stretchability and shrinkability that 
happen to be present. 

f Tlre width of a computed \vbox is the maximum distance by which an enclosed 
box extends to the right of the reference point, taking possible shifting into 
account. This width is always nonnegative. 

►EXERCISE 12.11 

JL Assume that \boxl is 1 pt high, 1 pt deep, and 1 pt wide; \box2 is 2 pt high, 
2pt deep, and 2pt wide; the baselineskip, lineskip, and lineskiplimit are all zero; and 
the \boxmaxdepth is very large. A third box is formed by saying 

\setbox3=\vbox to3pt{\moveright3pt\boxl\vskip-3pt plus3f il\box2} 

What are the height, depth, and width of \box3? Describe the position of the reference 
points of boxes 1 and 2 with respect to the reference point of box 3. 

EXERCISE 12.12 

JL JL Under the assumptions of the previous exercise, but with \baselineskip=9pt 
minus3fil, describe \box4 after 

\setbox4=\vbox to4pt{\vss\boxl\moveleft4pt\box2\vss} 

/>,/£,► EXERCISE 12.13 

JL JL Solve the previous problem but with \boxmaxdepth=-4pt. 

f We have observed that \vbox combines a bunch of boxes into a larger box that 
lias the same baseline as the bottom box inside. Tf-'X lias another operation 
called \vtop, which gives you a box like \vbox but with the same baseline as the top 
box inside. For example, 


\hbox{Here are \vtop{\hbox{two lines}\hbox{of text.}}} 


produces 


Here are two lines 
of text. 

/>)/>) You can say ‘\vtop to(dimen)’ and ‘\vtop spread(dimen)’ just as with \vbox, 
JL JL but you should realize what such a construction means. TjjX implements \vtop 
as follows: (1) First a vertical box is formed as if \vtop had been \vbox, using all of 
the rules for \vbox as given above. (2) The final height x is defined to be zero unless 
the very first item inside the new vbox is a box; in the latter case, x is the height of 
that box. (3) Let h and d be the height and depth of the vbox in step (1). TgX 
completes the \vtop by moving the reference point up or down, if necessary, so that 
the box has height x and depth h + d — x. 


boxmaxdepth 
glue set ratio 
glue set order 
vtop 



82 Chapter 12: Glue 


/>)/>)► EXERCISE 12.14 

JL JL Describe the empty boxes that you get from ‘\vbox to{dimen){}’ and ‘\vtop 
to{dimen){}’. What are their heights, depths, and widths? 

EXERCISE 12.15 

JL JL Define a macro \nullbox#l#2#3 that produces a box whose height, depth, and 
width are given by the three parameters. The box should contain nothing that will 
show up in print. 

f Tlie \vbox operation tends to produce boxes with large height and small depth, 
while \vtop tends to produce small height and large depth. If you’re trying 
to make a vertical list out of big vboxes, however, you may not be satisfied with either 
\vbox or \vtop; you might well wish that a box had two reference points simultaneously, 
one for the top and one for the bottom. If such a dual-reference-point scheme were in 
use, one could define interline glue based on the distance between the lower reference 
point of one box and the upper reference point of its successor in a vertical list. But 
alas, TjgX gives you only one reference point per box. 

f There’s a way out of this dilemma, using an important idea called a “strut.” 

Plain TjgX defines \ strut to be an invisible box of width zero that extends 
just enough above and below the baseline so that you would need no interline glue at 
all if every line contained a strut. (Baselines are 12 pt apart in plain IjgX; it turns 
out that \strut is a vertical rule, 8.5 pt high and 3.5 pt deep and Opt wide.) If you 
contrive to put a strut on the top line and another on the bottom line, inside your large 
vboxes, then it’s possible to obtain the correct spacing in a larger assembly by simply 
letting the boxes butt together. For example, the \footnote macro in Appendix B 
puts struts at the beginning and end of every footnote, so that the spacing will be right 
when several footnotes occur together at the bottom of some page. 

f lf you understand boxes and glue, you’re ready to learn the \rlap and \llap 
macros of plain TjgX; these names are abbreviations for “right overlap” and 
“left overlap.” Saying ‘\rlap{ (something)}’ is like typesetting (something) and then 
backing up as if you hadn’t typeset anything. More precisely, ‘\rlap{ (something)}’ 
creates a box of width zero, with ‘(something)’ appearing just at the right of that 
box (but not taking up any space). The \llap macro is similar, but it does the 
backspacing first; in other words, ‘\llap{ (something)}’ creates a box of width zero, 
with ‘(something)’ extending just to the left of that box. Using typewriter type, for 
example, you can typeset ‘C by saying either ‘\rlap/=’ or ‘/\llap=’. It’s possible to 
put text into the left margin using \llap, or into the right margin using \rlap, because 
IjgX does not insist that the contents of a box must be strictly confined within that 
box’s boundaries. 



The interesting thing about \rlap and \llap is that they can be done so 
simply with infinite glue. One way to define \rlap would be 

\def\rlap#l{{\setboxO=\hbox{#l}\copyO\kern-\wdO}} 


but there’s no need to do such a lengthy computation. The actual definition in Appen- 
dix B is much more elegant , namely, 
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\def \rlap#l{\hbox to Opt{#l\hss}} 
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and it’s worth pondering why this works. Suppose, for example, that you’re doing 
\rlap{g} where the letter ‘g’ is 5pt wide. Since \rlap makes an libox of width Opt, 
the glue represented by \hss must shrink by 5 pt. Well, that glue has 0 pt as its natural 
width, but it has infinite shrinkability, so it can easily shrink to —5 pt; and ‘\hskip-5pt’ 
is exactly what \rlap wants in this case. 



► EXERCISE 12.16 

Guess the definition of \llap, without peeking at Appendices A or B. 

► EXERCISE 12.17 

(This is a sequel to exercise 12.2, but it’s trickier.) Describe the result of 


\line{\hfil A puzzle . \hfilneg} 


hss 

TWAIN 

Clemens 

TSCHICHOLD 


There was things which he stretched, 
but mainly he told the truth. 
— MARK TWAIN, Huckleberry Finn (1884) 

Every shape exists only because of the space around it. 
. . . Hence there is a 'right' position for every shape in every situation. 

If we succeed in finding that position, we have done our job. 
— JAN TSCHICHOLD, Typographische Gestaltung (1935) 
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Just as people get into different moods, TgX gets into different “modes.” (Except 
that T^X is more predictable than people.) There are six modes: 

■ Vertical mode. [Building the main vertical list, from which the pages of 
output are derived.] 

■ Internal vertical mode. [Building a vertical list for a vbox.] 

■ Horizontal mode. [Building a horizontal list for a paragraph.] 

■ Restricted horizontal mode. [Building a horizontal list for an hbox.] 

■ Math mode. [Building a mathematical formula to be placed in a hori- 
zontal list.] 

■ Display math mode. [Building a mathematical formula to be placed on 
a line by itself, temporarily interrupting the current paragraph.] 

In simple situations, you don’t need to be aware of what mode TgX is in, because 
the computer just does the right thing. But when you get an error message 
that says 1 ! You can’t do such-and-such in restricted horizontal mode’, 
a knowledge of modes helps to explain why TgX thinks you goofed. 

Basically T^X is in one of the vertical modes when it is preparing a list 
of boxes and glue that will be placed vertically above and below one another on 
the page; it’s in one of the horizontal modes when it is preparing a list of boxes 
and glue that will be strung out horizontally next to each other with baselines 
aligned; and it’s in one of the math modes when it is reading a formula. 

A play-by-play account of a typical T^X job should make the mode idea 
clear: At the beginning, TjrpC is in vertical mode, ready to construct pages. If 
you specify glue or a box when T^X is in vertical mode, the glue or the box 
gets placed on the current page below what has already been specified. For 
example, the \vskip instructions in the sample run we discussed in Chapter 6 
contributed vertical glue to the page; and the \hrule instructions contributed 
horizontal rules at the top and bottom of the story. The \centerline commands 
also produced boxes that were included in the main vertical list; but those boxes 
required a bit more work than the rule boxes: TgX was in vertical mode when 
it encountered ‘\centerline{\bf A SHORT STORY}’, and it went temporarily 
into restricted horizontal mode while processing the words ‘A SHORT STORY’; 
then the digestive process returned to vertical mode, after setting the glue in 
the \centerline box. 

Continuing with the example of Chapter 6, TgpC switched into horizontal 
mode as soon as it read the ‘0’ of ‘Once upon a time’. Horizontal mode is the 
mode for making paragraphs. The entire paragraph (lines 7 to 11 of the story 
file) was input in horizontal mode; then the text was divided into output lines of 
the appropriate width, those lines were put in boxes and appended to the page 
(with appropriate interline glue between them), and TgX was back in vertical 
mode. The ‘M’ on line 12 started up horizontal mode again. 

When Tf^X is in vertical mode or internal vertical mode, the first token of 
a new paragraph changes the mode to horizontal for the duration of a paragraph. 
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In other words, things that do not have a vertical orientation cause the mode to 
switch automatically from vertical to horizontal. This occurs when you type any 
character, or \char or \accent or \hskip or \ u or \vrule or math shift ($); 
T^X inserts the current paragraph indentation and rereads the horizontal token 
as if it had occurred in horizontal mode. 


f You can also tell TI 5 X explicitly to go into horizontal mode, instead of relying 
on such implicit mode-switching, by saying ‘\indent’ or ‘\noindent’. For 
example, if line 7 of the story file in Chapter 6 had begun 


\indent Once upon a time, ... 

the same output would have been obtained, because ‘\indent’ would have instructed 
TTpC to begin the paragraph. And if that line had begun with 

\noindent Once upon a time, ... 


the first paragraph of the story would not have been indented. The \no indent com- 
mand simply tells TgX to enter horizontal mode if the current mode is vertical or 
internal vertical; \indent is similar, but it also creates an empty box whose width is 
the current value of \parindent, and it puts this empty box into the current horizontal 
list. Plain TgX sets \parindent=20pt. If you say \indent\indent, you get double 
indentation; if you say \noindent\noindent, the second \noindent does nothing. 

► EXERCISE 13.1 

If you say ‘\hbox{. . .}’ in horizontal mode, T^rX will construct the specified 
box and it will contribute the result to the current paragraph. Similarly, if you say 
‘\hbox{ . . . }’ in vertical mode, Tf.'X will construct a box and contribute it to the current 
page. What can you do if you want to begin a paragraph with an \hbox? 



When handling simple manuscripts, TgX spends almost all of its time in 
horizontal mode (making paragraphs), with brief excursions into vertical mode 
(between paragraphs). A paragraph is completed when you type \par or when 
your manuscript has a blank line, since a blank line is converted to \par by 
the reading rules of Chapter 8. A paragraph also ends when you type certain 
things that are incompatible with horizontal mode. For example, the command 
‘\vskip lin’ on line 16 of Chapter 6’s story file was enough to terminate the 
paragraph about ‘...beautiful documents.’; no \par was necessary, since 
\vskip introduced vertical glue that couldn’t belong to the paragraph. 

If a begin-math token ($) appears in horizontal mode, TgX plunges into 
math mode and processes the formula up until the closing then appends the 
text of this formula to the current paragraph and returns to horizontal mode. 
Thus, in the “I wonder why?” example of Chapter 12, TgX went into math mode 
temporarily while processing \ldots, treating the dots as a formula. 

However, if two consecutive begin-math tokens appear in a paragraph 
($$), TgX interrupts the paragraph where it is, contributes the paragraph-so-far 
to the enclosing vertical list, then processes a math formula in display math 
mode, then contributes this formula to the enclosing list, then returns to hori- 
zontal mode for more of the paragraph. (The formula to be displayed should 
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end with For example, suppose you type 

the number $$\pi \approx 3.1415926536$$ is important. 

TgX goes into display math mode between the $$’s, and the output you get 
states that the number 

t r « 3.1415926536 

is important. 

TgX ignores blank spaces and blank lines (or \par commands) when it’s 
in vertical or internal vertical mode, so you need not worry that such things 
might change the mode or affect a printed document. A control space (\ u ) will, 
however, be regarded as the beginning of a paragraph; the paragraph will start 
with a blank space after the indentation. 

At the end of a T^X manuscript it’s usually best to finish everything 
off by typing ‘\bye’, which is plain TgX’s abbreviation for ‘\vf ill\eject\end’. 
The ‘\vf ill’ gets T^X into vertical mode and inserts enough space to fill up the 
last page; ‘\eject’ outputs that last page; and ‘\end’ sends the computer into 
its endgame routine. 

f lj;X gets into internal vertical mode when you ask it to construct something 
from a vertical list of boxes (using \vbox or \vtop or \vcenter or \valign 
or \vadjust or \insert). It gets into restricted horizontal mode when you ask it to 
construct something from a horizontal list of boxes (using \hbox or \halign). Box 
construction is discussed in Chapters 12 and 21. We will see later that there is very 
little difference between internal vertical mode and ordinary vertical mode, and very 
little difference between restricted horizontal mode and ordinary horizontal mode; but 
they aren’t quite identical, because they have different goals. 

f Whenever TgX looks at a token of input to decide what should be done next, 
the current mode has a potential influence on what that token means. For 
example, \kern specifies vertical spacing in vertical mode, but it specifies horizontal 
spacing in horizontal mode; a math shift character like “$’ causes entry to math mode 
from horizontal mode, but it causes exit from math mode when it occurs in math mode; 
two consecutive math shifts ($$) appearing in horizontal mode will initiate display math 
mode, but in restricted horizontal mode they simply denote an empty math formula. 
TgX uses the fact that some operations are inappropriate in certain modes to help you 
recover from errors that might have crept into your manuscript. Chapters 24 to 26 
explain exactly what happens to every possible token in every possible mode. 

f lj;X often interrupts its work in one mode to do some task in another mode, 
after which the original mode is resumed again. For example, you can say 
‘\hbox{’ in any mode; when T^X digests this, it suspends whatever else it was doing 
and enters restricted horizontal mode. The matching *}’ will eventually cause the hbox 
to be completed, whereupon the postponed task will be taken up anew. In this sense 
TTX can be in many modes simultaneously, but only the innermost mode influences the 
calculations at any time; the other modes have been pushed out of T^rX’s consciousness. 
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One way to become familiar with TpX's modes is to consider the following 
curious test file called modes.tex, which exercises all the modes at once: 

1 \tracingcommands=l 

2 \hbox{ 

3 $ 

4 \vbox{ 

3 \noindent$$ 

6 x\showlists 

7 $$}$}\bye 

The first line of modes.tex tells l{gX to log every command it receives; TjgX will produce 
diagnostic data whenever \tracingcommands is positive. Indeed, if you run TpX on 
modes.tex you will get a modes.log file that includes the following information: 

{vertical mode: \hbox} 

{restricted horizontal mode: blank space} 

{math shift character $} 

{math mode : blank space} 

{\vbox} 

{internal vertical mode: blank space} 

{\noindent} 

{horizontal mode : math shift character $} 

{display math mode: blank space} 

{the letter x} 

The meaning is that T{gX first saw an \hbox token in vertical mode; this caused it to 
go ahead and read the ‘{’ behind the scenes. Then T}gX entered restricted horizontal 
mode, and saw the blank space token that resulted from the end of line 2 in the 
file. Then it saw a math shift character token (still in restricted horizontal mode), 
which caused a shift to math mode; another blank space came through. Then \vbox 
inaugurated internal vertical mode, and \noindent instituted horizontal mode within 
that; two subsequent $ signs led to display math mode. (Only the first $ was shown 
by \tracingcommands, because that one caused T{gX to look ahead for another.) 

The next thing in modes.log after the output above is ‘{\showlists}’. This 
is another handy diagnostic command that you can use to find out things that 
TJgX ordinarily keeps to itself; it causes TfrX to display the lists that are being worked 
on, in the current mode and in all enclosing modes where the work has been suspended: 

### display math mode entered at line 5 

\mathord 

.\faml x 

### internal vertical mode entered at line 4 

prevdepth ignored 

### math mode entered at line 3 

### restricted horizontal mode entered at line 2 
\glue 3.33333 plus 1.66666 minus 1.11111 
spacefactor 1000 

### vertical mode entered at line 0 
prevdepth ignored 
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In this case the lists represent five levels of activity, all present at the end of line 6 of 
modes.tex. The current mode is shown first, namely, display math mode, which began 
on line 5. The current math list contains one “mathord” object, consisting of the 
letter x in family 1. (Have patience and you will understand what that means, when 
you learn about TjgX’s math formulas.) Outside of display math mode comes internal 
vertical mode, to which Tf.'X will return when the paragraph containing the displayed 
formula is complete. The vertical list on that level is empty; ‘prevdepth ignored’ 
means that \prevdepth has a value < — 1000 pt, so that the next interline glue will be 
omitted (cf. Chapter 12). The math mode outside of this internal vertical mode has 
an empty list, likewise, but the restricted horizontal mode enclosing the math mode 
contains some glue. Finally, we see the main vertical mode that encloses everything; 
this mode was ‘entered at line O’, i.e., before the file modes.tex was input; nothing 
has been contributed so far to the vertical list on this outermost level. 


► EXERCISE 13.2 

Why is there glue in one of these lists but not in the others? 

► EXERCISE 13.3 

After this output of \showlists, the modes.log file contains further output 
from \tracingcommands. In fact, the next two lines of that file are 

{math shift character $} 

{horizontal mode : end-group character }} 



because the “$$’ on line 7 finishes the displayed formula, and this resumes horizontal 
mode for the paragraph that was interrupted. What do you think are the next three 
lines of modes . log ? 


/£> ►EXERCISE 13.4 

JL Suppose TgX has generated a document without ever leaving vertical mode. 
What can you say about that document? 


EXERCISE 13.5 

JL JL Some of IjrX’s modes cannot immediately enclose other modes; for example, 
display math mode is never directly enclosed by horizontal mode, even though displays 
occur within paragraphs, because an interrupted paragraph-so-far of horizontal mode is 
always completed and removed from TgX’s memory before the processing of a displayed 
formula begins. Give a complete characterization of all pairs of consecutive modes that 
can occur in the output of \showlists. 
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Every mode of life has its conveniences. 
— SAMUEL JOHNSON, The Idler (1758) 

[Hindu musicians] have eighty-four modes, 
of which thirty-six are in general use, 
and each of which, it appears, has a peculiar expression, 
and the power of moving some particular sentiment or affection. 
— MOUNTSTUART ELPHINSTONE, History of India (1841) 
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One of a typesetting system’s chief duties is to take a long sequence of words 
and to break it up into individual lines of the appropriate size. For example, 
every paragraph of this manual has been broken into lines that are 29 picas wide, 
but the author didn’t have to worry about such details when he composed the 
manuscript. TgX chooses breakpoints in an interesting way that considers each 
paragraph in its entirety; the closing words of a paragraph can actually influence 
the appearance of the first line. As a result, the spacing between words is as 
uniform as possible, and the computer is able to reduce the number of times that 
words must be hyphenated or formulas must be split between lines. 

The experiments of Chapter 6 have already illustrated the general ideas: 
We discussed the notion of “badness,” and we ran into “overfull” and “underfull” 
boxes in difficult situations. We also observed that different settings of T^X’s 
\tolerance parameter will produce different effects; a higher tolerance means 
that wider spaces are acceptable. 

Tf^X will find the absolutely best way to typeset any given paragraph, 
according to its ideas of minimum badness. But such “badness” doesn’t account 
for everything, and if you rely entirely on an automatic scheme you will occasion- 
ally encounter line breaks that are not really the best on psychological grounds; 
this is inevitable, because computers don’t understand things the way people do 
(at least not yet). Therefore you’ll sometimes want to tell the machine that 
certain places are not good breakpoints. Conversely, you will sometimes want 
to force a break at a particular spot. TgX provides a convenient way to avoid 
psychologically bad breaks, so that you will be able to obtain results of the finest 
quality by simply giving a few hints to the machine. 

“Ties” — denoted by ‘"’ in plain TgX — are the key to successful line 
breaking. Once you learn how to insert them, you will have graduated from 
the ranks of ordinary T^Xmcnl typists to the select group of Distinguished 
T^Xmcians. And it’s really not difficult to train yourself to insert occasional 
ties, almost without thinking, as you type a manuscript. 

When you type ~ it’s the same as typing a space, except that TgX won’t 
break a line at this space. Furthermore, you shouldn’t leave any blanks next to 
the ~, since they will count as additional spaces. If you put ~ at the very end 
of a line in your input file, you’ll get a wider space than you want, because the 
(return) that follows the ~ produces an extra space. 

We have already observed in Chapter 12 that it’s generally a good idea 
to type ~ after an abbreviation that does not come at the end of a sentence. Ties 
also belong in several other places: 

■ In references to named parts of a document: 

Chapter" 12 Theorem" 1.2 

Appendix~A Table~\hbox{B-8} 

Figure~3 Lemmas 5 and~6 

(No ~ appears after ‘Lemmas’ in the final example, since there’s no harm in having 
‘5 and 6’ at the beginning of a line. The use of \hbox is explained below.) 


H&J, see hyphenation, line break 

justification, see setting glue, line 

quad left, see flush left 

quad right, see flush right 

quad middle, see :break 
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auxiliary space, see tie 
tilde 

line breaks, avoiding 
breaks, avoiding bad 
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• Between a person’s forenames and between multiple surnames: Knuth 

Trabb Pardo 

Donald~E. Knuth Luis~I. Trabb~Pardo van der Waerden 

Charles XII 

Bartel Leendert van der Waerden Charles XII Vallee Poussin 

enumerated cases within a paragi 

Note that it is sometimes better to hyphenate a name than to break it be- 
tween words; e.g., ‘Don-’ and ‘aid E. Knuth’ is more tolerable than ‘Donald’ 
and ‘E. Knuth’. The previous rule can be regarded as a special case of this one, 
since we may think of ‘Chapter 12’ as a compound name; another example is 
‘register~X’. Sometimes a name is so long that we dare not tie it all together, 
lest there be no way to break the line: 

Charles Louis Xavier" Joseph de~la Vall\’ee~Poussin. 

■ Between math symbols in apposition with nouns: 

dimension~$d$ width~$w$ function~$f (x) $ 

string~$s$ of length~$l$ 

However, the last example should be compared with 

string~$s$ of length $l$~or more. 

■ Between symbols in series: 

1,~2, or~3 
$a$,~$b$, and~$c$. 

1,~2, \dots,~$n$. 

■ When a symbol is a tightly bound object of a preposition: 

of ~$x$ 
from 0 to~l 
increase $z$ by~l 
in common with~$m$. 

The rule does not, however, apply to compound objects: 
of $u$~and~$v$. 

■ When mathematical phrases are rendered in words: 

equals~$n$ less than~$\epsilon$ (given~$X$) 

mod~2 modulo~$p~e$ for all large~$n$ 

Compare ‘is~15’ with ‘is 15~times the height’. 

■ When cases are being enumerated within a paragraph: 

(b)~Show that $f(x)$ is (1) "continuous ; (2) "bounded. 

It would be nice to boil all of these rules down to one or two simple principles, 
and it would be even nicer if the rules could be automated so that keyboarding 
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could be done without them; but subtle semantic considerations seem to be 
involved. Therefore it’s best to use your own judgment with respect to ties. The 
computer needs your help. 

A tie keeps Tj^X from breaking at a space, but sometimes you want to 
prevent the machine from breaking at a hyphen or a dash. This can be done 
by using \hbox, because TgX will not split up the contents of a box; boxes are 
indecomposable units, once they have been constructed. We have already il- 
lustrated this principle in the ‘Table~\hbox{B-8}’ example considered earlier. 
Another example occurs when you are typing the page numbers in a bibliographic 
reference: It doesn’t look good to put ‘22.’ on a line by itself, so you can type 
‘\hbox{13 — 22} . ’ to prohibit breaking ‘13-22.’ On the other hand, TgX doesn’t 
often choose line breaks at hyphens, so you needn’t bother to insert \hbox com- 
mands unless you need to correct a bad break that TgX has already made on a 
previous run. 

► EXERCISE 14.1 

Here are some phrases culled from previous chapters of this manual. How do 
you think the author typed them? 

(cf. Chapter 12). 

Chapters 12 and 21. 

line 16 of Chapter 6’s story 

lines 7 to 11 

lines 2, 3, 4, and 5. 

(2) a big black bar 

All 256 characters are initially of category 12, 

letter x in family 1. 

the factor /, where n is 1000 times /. 


► EXERCISE 14.2 

How would you type the phrase ‘for all n greater than ii 0 ’ ? 

► EXERCISE 14.3 

And how would you type ‘exercise 4.3.2-15’? 


► EXERCISE 14.4 

Why is it better to type ‘Chapter" 12’ than to type ‘\hbox{Chapter 12}’? 


/■£> ►EXERCISE 14.5 

JL TjrX will sometimes break a math formula after an equals sign. How can you 
stop the computer from breaking the formula ‘m = 0’ ? 


^►EXERCISE 14.6 

Explain how you could instruct TjjX not to make any breaks after explicit 
hyphens and dashes. (This is useful in lengthy bibliographies.) 


Sometimes you want to permit a line break after a ‘/’ just as if it were 
a hyphen. For this purpose plain TgX allows you to say ‘\slash’; for example, 
‘input\slash output’ produces ‘input/output’ with an optional break. 


hyphen 

dash 

hbox 

bibliographic reference 

bibliographies 

slash 



94 Chapter 14-' How Tppi Breaks Paragraphs into Lines 


If you want to force T^X to break between lines at a certain point in the 
middle of a paragraph, just say ‘\break’. However, that might cause the line to 
be really spaced out. 

If you want TgX to fill up the right-hand part of a line with blank space just 
before a forced line break, 

without indenting the next line, say ‘\hf il\break’. 

f You may have several consecutive lines of input for which you want the output 
to appear line-for-line in the same way. One solution is to type ‘\par’ at the 
end of each input line; but that’s somewhat of a nuisance, so plain TfrX provides the 
abbreviation ‘\obeylines’, which causes each end-of-line in the input to be like \par. 
After you say \obeylines you will get one line of output per line of input, unless an 
input line ends with or unless it is so long that it must be broken. For example, you 
probably want to use \obeylines if you are typesetting a poem. Be sure to enclose 
\obeylines in a group, unless you want this “poetry mode” to continue to the end of 
your document. 


{\obeylines\smallskip 
Roses are red, 

\quad Violets are blue; 

Rhymes can be typeset 
\quad With boxes and glue. 

\smallskip} 

► EXERCISE 14.7 

Explain the uses of \quad in this poem. What would have happened if ‘\quad’ 
had been replaced by ‘\indent’ in both places? 



Roughly speaking, Tj.'X breaks paragraphs into lines in the following 
way: Breakpoints are inserted between words or after hyphens so as to produce 
lines whose badnesses do not exceed the current \tolerance. If there’s no way 
to insert such breakpoints, an overfull box is set. Otherwise the breakpoints are 
chosen so that the paragraph is mathematically optimal, i.e., best possible, in 
the sense that it has no more “demerits” than you could obtain by any other 
sequence of breakpoints. Demerits are based on the badnesses of individual lines 
and on the existence of such things as consecutive lines that end with hyphens, 
or tight lines that occur next to loose ones. 

f But the informal description of line breaking in the previous paragraph is 
an oversimplification of what really happens. The remainder of this chapter 
explains the details precisely, for people who want to apply TRX in nonstandard ways. 
IRX’s line-breaking algorithm has proved to be general enough to handle a surprising 
variety of different applications; this, in fact, is probably the most interesting aspect 
of the whole TRX system. However, every paragraph from now on until the end of the 
chapter is prefaced by at least one dangerous bend sign, so you may want to learn the 
following material in easy stages instead of all at once. 


f Before the lines have been broken, a paragraph inside of TRX is actually a 
horizontal list , i.e., a sequence of items that TyX has gathered while in hori- 
zontal mode. We have been saying informally that a horizontal list consists of boxes 


line breaks, forcing 

breaks, forcing good 

break 

underfull 
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poem 

quad 

indent 
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demerits 
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and glue; the truth is that boxes and glue aren’t the whole story. Each item in a 
horizontal list is one of the following types of things: 

■ a box (a character or ligature or rule or hbox or vbox); 

■ a discretionary break (to be explained momentarily); 

■ a “whatsit” (something special to be explained later); 

■ vertical material (from \mark or \vadjust or \insert); 

■ a glob of glue (or \leaders, as we will see later); 

■ a kern (something like glue that doesn’t stretch or shrink); 

■ a penalty (representing the undesirability of breaking here); 

■ “math-on” (beginning a formula) or “nratli-off” (ending a formula). 

The last four types (glue, kern, penalty, and math items) are called discardable, since 
they may change or disappear at a line break; the first four types are called non- 
discardable, since they always remain intact. Many of the things that can appear in 
horizontal lists have not been touched on yet in this manual, but it isn’t necessary to 
understand them in order to understand line breaking. Sooner or later you’ll learn how 
each of the gismos listed above can infiltrate a horizontal list; and if you want to get 
a thorough understanding of 1)jX’s internal processes, you can always use \showlists 
with various features of the language, in order to see exactly what 1)?X is doing. 

f A discretionary break consists of three sequences of characters called the pre- 
break, post-break, and no-break texts. The idea is that if a line break occurs 
here, the pre-break text will appear at the end of the current line and the post-break 
text will occur at the beginning of the next line; but if no break occurs, the no-break 
text will appear in the current line. Users can specify discretionary breaks in complete 
generality by writing 

\discretionary{ (pre-break text)H(post-break text) }{ (no-break text)} 


where the three texts consist entirely of characters, boxes, and kerns. For example, TjgX 
can hyphenate the word ‘difficult’ between the f’s, even though this requires breaking 
the ‘ffi’ ligature into ‘f-’ followed by an ‘fi’ ligature, if the horizontal list contains 

di\discretionary{f-}{f i}{ff i}cult . 


Fortunately you need not type such a mess yourself; TgX’s hyphenation algorithm 
works behind the scenes, taking ligatures apart and putting them into discretionary 
breaks when necessary. 



The most common case of a discretion ary break is a simple discretionary 
hyphen 


\discretionary{-}{}{} 

for which TgX accepts the abbreviation The next most common case is 
\discretionary{}{}{} 

(an “empty discretionary”), which automatically inserts after and after every 
ligature that ends with In the case of plain TJeX, empty discretionaries are therefore 
inserted after hyphens and dashes. (Each font has an associated \hyphenchar, which 
we can assume for simplicity is equal to 
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f When TgX hyphenates words, it simply inserts discretionary breaks into the 
horizontal list. For example, the words ‘discretionary hyphens’ are trans- 
formed into the equivalent of 

dis\-cre\-tionary hy\-phens 

if hyphenation becomes necessary. But Tj^X doesn’t apply its hyphenation algorithm 
to any word that already contains a discretionary break; therefore you can use explicit 
discretionaries to override TpX’s automatic method, in an emergency. 

►EXERCISE 14.8 

JL Some compound words in German text change their spelling when they are 
split between lines. For example, ‘baeken’ becomes ‘bak-ken’ and ‘Bettuch’ becomes 
‘Bett-tuch’. How can you instruct TpX to produce this effect? 

f in order to save time, Tj?X tries first to break a paragraph into lines without 
inserting any discretionary hyphens. This first pass will succeed if a sequence 
of breakpoints is found for which none of the resulting lines has a badness exceeding 
the current value of \pretolerance. If the first pass fails, the method of Appendix H 
is used to hyphenate each word of the paragraph by inserting discretionary breaks 
into the horizontal list, and a second attempt is made using \tolerance instead of 
\pretolerance. When the lines are fairly wide, as they are in this manual, experiments 
show that the first pass succeeds more than 90% of the time, and that fewer than 2 words 
per paragraph need to be subjected to the hyphenation algorithm, on the average. 
But when the lines are very narrow the first pass usually fails rather quickly. Plain 
TjgX sets \pretolerance=100 and \tolerance=200 as the default values. If you make 
\pretolerance=10000, the first pass will essentially always succeed, so hyphenations 
will not be tried (and the spacing may be terrible); on the other hand if you make 
\pretolerance=-l, Tj?X will omit the first pass and will try to hyphenate immediately. 

f Line breaks can occur only in certain places within a horizontal list. Roughly 
speaking, they occur between words and after hyphens, but in actuality they 
are permitted in the following five cases: 

a) at glue, provided that this glue is immediately preceded by a non-discardable 
item, and that it is not part of a math formula (i.e., not between math-on and 
math-off). A break “at glue” occurs at the left edge of the glue space, 

b) at a kern, provided that this kern is immediately followed by glue, and that it 
is not part of a math formula, 

c) at a math-off that is immediately followed by glue. 

d) at a penalty (which might have been inserted automatically in a formula), 

e) at a discretionary break. 

Notice that if two globs of glue occur next to each other, the second one will never be 
selected as a breakpoint, since it is preceded by glue (which is discardable). 

f Each potential breakpoint has an associated “penalty,” which represents the 
“aesthetic cost” of breaking at that place. In cases (a), (b), (c), the penalty is 
zero; in case (d) an explicit penalty has been specified; and in case (e) the penalty is the 
current value of \hyphenpenalty if the pre-break text is nonempty, or the current value 
of \exhyphenpenalty if the pre-break text is empty. Plain TjrX sets \hyphenpenalty=50 
and \exhyphenpenalty=50. 
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f For example, if you say ‘\penalty 100’ at some point in a paragraph, that 
position will be a legitimate place to break between lines, but a penalty of 100 
will be charged. If you say ‘\penalty-100’ you are telling Tj?X that this is a rather 
good place to break, because a negative penalty is really a “bonus”; a line that ends 
with a bonus might even have “merits” (negative demerits). 

f Any penalty that is 10000 or more is considered to be so large that TjgX will 
never break there. At the other extreme, any penalty that is —10000 or less 
is considered to be so small that TgX will always break there. The \nobreak macro of 
plain TgX is simply an abbreviation for XpenaltylOOOO’, because this prohibits a line 
break. A tie in plain TjjX is equivalent to ‘\nobreak\ u ’; there will be no break at the 
glue represented by \ u in this case, because glue is never a legal breakpoint when it is 
preceded by a discardable item like a penalty. 

►EXERCISE 14.9 

JL Guess how the \break macro is defined in plain IjjX. 

/ 5 X ►EXERCISE 14.10 

JL What happens if you say \nobreak\break or \break\ no break? 

f Wlien a line break actually does occur, TgX removes all discardable items that 
follow the break, until coming to something non-discardable, or until coming 
to another chosen breakpoint. For example, a sequence of glue and penalty items will 
vanish as a unit, if no boxes intervene, unless the optimum breakpoint sequence includes 
one or more of the penalties. Math-on and math-off items act essentially as kerns that 
contribute the spacing specified by \mathsurround; such spacing will disappear into the 
line break if a formula conies at the very end or the very beginning of a line, because 
of the way the rules have been formulated above. 

/gb/gb The badness of a line is an integer that is approximately 100 times the cube 
JL JL of the ratio by which the glue inside the line must stretch or shrink to make 
an libox of the required size. For example, if the line has a total shrinkability of 
10 points, and if the glue is being compressed by a total of 9 points, the badness is 
computed to be 73 (since 100 x (9/10) 3 = 72.9); similarly, a line that stretches by 
twice its total stretchability has a badness of 800. But if the badness obtained by 
this method turns out to be more than 10000, the value 10000 is used. (See the 
discussion of “glue set ratio” r and “glue set order” i in Chapter 12; if i yl 0, there is 
infinite stretchability or shrinkability, so the badness is zero, otherwise the badness is 
approximately min(100r 3 , 10000).) Overfull boxes are considered to be infinitely bad; 
they are avoided whenever possible. 

/gb/gb A line whose badness is 13 or more has a glue set ratio exceeding 50%. We 
JL JL call such a line tight if its glue had to shrink, loose if its glue had to stretch, 
and very loose if it had to stretch so much that the badness is 100 or more. But if the 
badness is 12 or less we say that the line is decent. Two adjacent lines are said to be 
visually incompatible if their classifications are not adjacent, i.e., if a tight line is next 
to a loose or very loose line, or if a decent line is next to a very loose one. 

/gb/gb Te% rates each potential sequence of breakpoints by totalling up demerits that 
JL JL are assessed to individual lines. The goal is to choose breakpoints that yield 
the fewest total demerits. Suppose that a line has badness b, and suppose that the 
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penalty p is associated with the breakpoint at the end of this line. As stated above, 
TjgX will not even consider such a line if p > 10000, or if b exceeds the current tolerance 
or pretolerance. Otherwise the demerits of such a line are defined by the formula 

({l + b) 2 +p 2 , if 0 < p < 10000; 
d = < (l + b) 2 - p 2 , if -10000 < p < 0; 

1(1 -ft) 2 , if p< -10000. 

Here l is the current value of \linepenalty, a parameter that can be increased if you 
want TgX to try harder to keep all paragraphs to the minimum number of lines; plain 
TjgX sets \linepenalty=10. For example, a line with badness 20 ending at glue will 
have (10 + 20) 2 = 900 demerits, if l = 10, since there’s no penalty for a break at glue. 
Minimizing the total demerits of a paragraph is roughly the same as minimizing the sum 
of the squares of the badnesses and penalties; this usually means that the maximum 
badness of any individual line is also minimized, over all sequences of breakpoints. 

&><&)*■ EXERCISE 14.11 

JL JL The formula for demerits has a strange discontinuity: It seems more reasonable 
at first to define d = {l + b) 2 — 10000 2 , in the case p < —10000. Can you account for 
this apparent discrepancy? 

Additional demerits are assessed based on pairs of adjacent lines. If two con- 
JL JL secutive lines are visually incompatible, in the sense explained a minute ago, 
the current value of \adj demerits is added to d. If two consecutive lines end with dis- 
cretionary breaks, the \doublehyphendemerits are added. And if the second-last line of 
the entire paragraph ends with a discretionary, the \f inalhyphendemerits are added. 
Plain TjgX sets up the values \adjdemerits=10000, \doublehyphendemerits=10000, 
and \f inalhyphendemerits=5000. Demerits are in units of “badness squared,” so the 
demerit-oriented parameters need to be rather large if they are to have much effect; 
but tolerances and penalties are given in the same units as badness. 

/>)/>) If you set \tracingparagraphs=l, your log file will contain a summary of 
JL JL TfgX’s line-breaking calculations, so you can watch the tradeoffs that occur 
when parameters like \linepenalty and \hyphenpenalty and \adj demerits are twid- 
dled. The line-break data looks pretty scary at first, but you can learn to read it with a 
little practice; this, in fact, is the best way to get a solid understanding of line breaking. 
Here is the trace that results from the second paragraph of the story file in Chapter 6, 
when \hsize=2.5in and \tolerance=1000: 


[]\tenrm Mr. Drofnats or ‘‘E. J.,’’ as he pre- 

@\discretionary via @00 b=0 p=50 d=2600 
@01: line 1.2- t=2600 -> <§<§0 

ferred to be called was hap-pi-est when 

0 via b=131 p=0 d=29881 
<§<§2 : line 2.0 t=32481 -> <§@1 
he 

® via b=25 p=0 d=1225 

@@3: line 2.3 t=3825 -> @01 

was at work type-set-ting beau-ti-ful doc- 

@\discretionary via @@2 b=l p=50 d=12621 

@\discretionary via @@3 b=291 p=50 d=103101 


linepenalty 
adjdemerits 
doublehyphendemerits 
finalhyphendemerits 
I racing paragraphs 
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<§<§4 : line 3.2- t=45102 -> @02 
u- 

@\discretionary via <§<§3 b=44 p=50 d=15416 
@05: line 3.1- t=19241 -> @03 
ments . 

@\par via @04 b=0 p=-10000 d=5100 
@\par via @05 b=0 p=-10000 d=5100 
@06 : line 4.2- t=24341 -> @05 


Lines that begin with ‘00’ represent feasible breakpoints, i.e., breakpoints that can 
be reached without any badness exceeding the tolerance. Feasible breakpoints are 
numbered consecutively, starting with @01; the beginning of the paragraph is considered 
to be feasible too, and it is number @00. Lines that begin with ‘0’ but not ‘00’ are 
candidate ways to reach the feasible breakpoint that follows; TpX will select only the 
best candidate, when there is a choice. Lines that do not begin with ‘0’ indicate how 
far TjgX has gotten in the paragraph. Thus, for example, we find ‘@@2: line 2.0 
t=32481 -> 001’ after . .hap-pi-est when’ and before ‘he’, so we know that feasible 
breakpoint @02 occurs at the space between the words when and he. The notation Tine 
2.0’ means that this feasible break comes at the end of line 2, and that this line will 
be very loose. (The suffixes .0, .1, .2, .3 stand respectively for very loose, loose, 
decent, and tight.) A hyphen is suffixed to the line number if that line ends with a 
discretionary break, or if it is the final line of the paragraph; for example, Tine 1.2-’ 
is a decent line that was hyphenated. The notation ‘t=32481’ means that the total 
demerits from the beginning of the paragraph to @02 are 32481, and ‘-> 001’ means 
that the best way to get to 002 is to come from 001. On the preceding line of trace data 
we see the calculations for a typeset line to this point from 001: the badness is 131, 
the penalty is 0, hence there are 29881 demerits. Similarly, breakpoint 003 presents 
an alternative for the second line of the paragraph, obtained by breaking between ‘he’ 
and ‘was’; this one makes the second line tight, and it has only 3825 demerits when 
the demerits of line 1 are added, so it appears that 003 will work much better than 
@02. However, the next feasible breakpoint (004) occurs after ‘doc-’, and the line from 
@02 to @04 has only 12621 demerits, while the line from @03 to @04 has a whopping 
103101; therefore the best way to get from @00 to @04 is via @02. If we regard demerits 
as distances, TjrX is finding the “shortest paths” from @00 to each feasible breakpoint 
(using a variant of a well-known algorithm for shortest paths in an acyclic graph). 
Finally the end of the paragraph comes at breakpoint @06, and the shortest path from 
@00 to @06 represents the best sequence of breakpoints. Following the arrows back 
from @06, we deduce that the best breaks in this particular paragraph go through @05, 
@03, and @01. 

EXERCISE 14.12 

JL JL Explain why there are 29881 demerits from @01 to @02, and 12621 demerits 
from @02 to @04. 



If ‘b=*’ appears in such trace data, it means that an infeasible breakpoint had 
to be chosen because there was no feasible way to keep total demerits small. 



points. 


We still haven’t discussed the special trick that allows the final line of a para- 
graph to be shorter than the others. Just before TgX begins to choose break- 
it. does two important things: (1) If the final item of the current horizontal 
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list is glue, that glue is discarded. (The reason is that a blank space often gets into a 
token list just before \par or just before $$, and this blank space should not be part 
of the paragraph.) (2) Three more items are put at the end of the current horizontal 
list: \penalty 10000 (which prohibits a line break); \hskip\parf illskip (which adds 
“finishing glue” to the paragraph); and \penalty-10000 (which forces the final break). 
Plain Tp;X sets \parf illskip=0pt pluslf il, so that the last line of each paragraph will 
be filled with white space if necessary; but other settings of \parf illskip are appro- 
priate in special applications. For example, the present paragraph ends flush with the 
right margin, because it was typeset with \parf illskip=0pt; the author didn’t have to 
rewrite any of the text in order to make this possible, since a long paragraph generally 
allows so much flexibility that a line break can be forced at almost any point. You 
can have some fun playing with paragraphs, because the algorithm for line breaking 
occasionally appears to be clairvoyant. Just write paragraphs that are long enough. 


/£> ►EXERCISE 14.13 

JL Ben User decided to say ‘\hf ilnegXpar’ at the end of a paragraph, intending 
that the negative stretchability of \hfilneg would cancel with the \parf illskip of 
plain TjgX. Why didn’t his bright idea work? 

►EXERCISE 14.14 

JL How can you set \parf illskip so that the last line of a paragraph has exactly 
as much white space at the right as the first line has indentation at the left? 

<§)&)*■ EXERCISE 14.15 

JL JL Since TgX reads an entire paragraph before it makes any decisions about 
line breaks, the computer’s memory capacity might be exceeded if you are typesetting 
the works of some philosopher or modernistic novelist who writes 200-line paragraphs. 
Suggest a way to cope with such authors. 


TjjX has two parameters called \leftskip and \rightskip that specify glue 
to be inserted at the left and right of every line in a paragraph; this glue is 
taken into account when badnesses and demerits are computed. Plain TJtX normally 
keeps \leftskip and \rightskip zero, but it has a ‘Xnarrower’ macro that increases 
both of their values by the current \parindent. You may want to use Xnarrower when 
quoting lengthy passages from a book. 

{\narrower\smallskip\no indent 
This paragraph will have narrower lines than 
the surrounding paragraphs do , because it 
uses the ‘ ‘narrower’ ’ feature of plain \TeX. 

The former margins will be restored after 
this group ends . \smallskip} 



(Try it.) The second ‘\smallskip’ in this example ends the paragraph. It’s important 
to end the paragraph before ending the group, for otherwise the effect of \narrower 
will disappear before Tj^X begins to choose line breaks. 

►EXERCISE 14.16 

JL When an entire paragraph is typeset in italic or slanted type, it sometimes 
appears to be offset on the page with respect to other paragraphs. Explain how you 
could use \leftskip and \rightskip to shift all lines of a paragraph left by lpt. 
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/£> ►EXERCISE 14.17 

JL The \centerline, \leftline, \rightline, and Mine macros of plain TjgX 
don’t take \leftskip and \rightskip into account. How could you make them do so? 


^>> If you suspect that \raggedright setting is accomplished by some appropriate 
manipulation of \rightskip, you are correct. But some care is necessary. For 
example, a person can set \rightskip=0pt pluslfil, and every line will be filled 
with space at the right. But this isn’t a particularly good way to make ragged-right 
margins, because the infinite stretchability will assign zero badness to lines that are 
very short. To do a decent job of ragged-right setting, the trick is to set \rightskip 
so that it will stretch enough to make line breaks possible, yet not too much, because 
short lines should be considered bad. Furthermore the spaces between words should 
be fixed so that they do not stretch or shrink. (See the definition of \raggedright in 
Appendix B.) It would also be possible to allow a little variability in the interword 
glue, so that the right margin would not be quite so ragged but the paragraphs would 
still have an informal appearance. 


f lj;X looks at the parameters that affect line breaking only when it is breaking 
lines. For example, you shouldn’t try to change the \hyphenpenalty in the 
middle of a paragraph, if you want TjjX to penalize the hyphens in one word more than 
it does in another word. The relevant values of \hyphenpenalty, \rightskip, \hsize, 
and so on, are the ones that are current at the end of the paragraph. On the other 
hand, the width of indentation that you get implicitly at the beginning of a paragraph 
or when you say ‘\indent’ is determined by the value of \parindent at the time the 
indentation is contributed to the current horizontal list, not by its value at the end 
of the paragraph. Similarly, penalties that are inserted into math formulas within a 
paragraph are based on the values of \binoppenalty and \relpenalty that are current 
at the end of each particular formula. Appendix D contains an example that shows how 
to have both ragged-right and ragged-left margins within a single paragraph, without 
using \leftskip or \rightskip. 


the other 
In additio 


f it’s possible to control the length of lines in a much more general way, if 
simple changes to \leftskip and \rightskip aren’t flexible enough for your 
purposes. For example, a semicircular hole has been cut out of the present 
paragraph, in order to make room for a circular illustration that con- The area , 

tains some of Galileo’s immortal words about circles; all of the line 
breaks in this paragraph and in the circular quotation were found 
by TjjX’s line-breaking algorithm. You can specify an essentially 
arbitrary paragraph shape by saying \parshape=(number), where 
the (number) is a positive integer n, followed by 2 n (dimen) spec- 
ifications. In general, ‘\parshape=n ii h is h ■ ■ ■ in In' specifies 
a paragraph whose first n lines will have lengths h, h, ■ ■ . , l n , 
respectively, and they will be indented from the left margin by the 
respective amounts i \ , is,, . . . , %. If the paragraph has fewer than 
n lines, the additional specifications will be ignored; if it has more 
than n lines, the specifications for line n will be repeated ad infinitum. 

You can cancel the effect of a previously specified \parshape by saying ‘\parshape 
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= 0 ’. 


^►EXERCISE 14.18 

Typeset the following Pascalian quotation in the shape of an isosceles triangle: 
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“I turn, in the following treatises, to various uses of those triangles whose generator 
is unity. But I leave out many more than I include; it is extraordinary how fertile in 
properties this triangle is. Everyone can try his hand.” 

You probably won’t need unusual parsliapes very often. But there’s a special 
case that occurs rather frequently, so T£X provides a special abbreviation 
for it in terms of two parameters called \hangindent and \hangafter. The command 
‘\hangindent={dimen}’ specifies a so-called hanging indentation, and the command 
‘\hangafter={number)’ specifies the duration of that indentation. Let x and n be the 
respective values of \hangindent and \hangafter, and let h be the value of \hsize; 
then if n > 0, hanging indentation will occur on lines n + 1, n + 2, ... of the paragraph, 
but if n < 0 it will occur on lines 1,2,..., |n|. Hanging indentation means that lines will 
be of width h — |^| instead of their normal width h; if x > 0, the lines will be indented 
at the left margin, otherwise they will be indented at the right margin. For example, 
the “dangerous bend” paragraphs of this manual have a hanging indentation of 3 picas 
that lasts for two lines; they were set with \hangindent=3pc and \hangafter=-2. 

Plain TjjX uses hanging indentation in its ‘\item’ macro, which produces a 
paragraph in which every line has the same indentation as a normal \ indent. 
Furthermore, \item takes a parameter that is placed into the position of the indentation 
on the first line. Another macro called ‘\itemitem’ does the same thing but with double 
indentation. For example, suppose you type 

\item{l.} This is the first of several cases that are being 
enumerated, with hanging indentation applied to entire paragraphs. 
\itemitem{a)} This is the first subcase. 

\itemitem{b)} And this is the second subcase. Notice 
that subcases have twice as much hanging indentation. 

\item{2.} The second case is similar. 




Then you get the following output: 

1. This is the first of several cases that are being enumerated, with hanging 
indentation applied to entire paragraphs. 

a) This is the first subcase. 

b) And this is the second subcase. Notice that subcases have twice as 
much hanging indentation. 

2. The second case is similar. 


(Indentations in plain T^X are not actually as dramatic as those displayed here; Appen- 
dix B says ‘\parindent=20pt’, but this manual has been set with \parindent=36pt.) 
It is customary to put \medskip before and after a group of itemized paragraphs, and 
to say \noindent before any closing remarks that apply to all of the cases. Blank lines 
are not needed before \item or \itemitem, since those macros begin with \par. 

►EXERCISE 14.19 

JL Suppose one of the enumerated cases continues for two or more paragraphs. 
How can you use \item to get hanging indentation on the subsequent paragraphs? 
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► EXERCISE 14.20 

Explain how to make a “bulleted” item that says V instead of ‘I.’. 
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/>)/>>► EXERCISE 14.21 

JL JL The ‘\item’ macro doesn’t alter the right-hand margin. How could you indent 
at both sides? 

&)<&)*■ EXERCISE 14.22 

JL JL Explain how you could specify a hanging indentation of —2 ems (i.e., the lines 
should project into the left margin), after the first two lines of a paragraph. 

f lf \parshape and hanging indentation have both been specified, \parshape 
takes precedence and \hangindent is ignored. You get the normal paragraph 
shape, in which every line width is \hsize, when \parshape=0, \hangindent=Opt, and 
\hangafter=l. T^X automatically restores these normal values at the end of every 
paragraph, and (by local definitions) whenever it enters internal vertical mode. For 
example, hanging indentation that might be present outside of a \vbox construction 
won’t occur inside that vbox, unless you ask for it inside. 

EXERCISE 14.23 

JL JL Suppose you want to leave room at the right margin for a rectangular illus- 
tration that takes up 15 lines, and you expect that three paragraphs will go by before 
you have typeset enough text to get past that illustration. Suggest a good way to do 
this without trial and error, given the fact that IjrX resets hanging indentation. 

/gb/gb If displayed equations occur in a paragraph that has a nonstandard shape, TgX 
JL JL always assumes that the display takes up exactly three lines. For example, a 
paragraph that has four lines of text, then a display, then two more lines of text, is 
considered to be 4 + 34-2 = 9 lines long; the displayed equation will be indented and 
centered using the paragraph shape information appropriate to line 6. 

TfcX has an internal integer variable called \prevgraf that records the number 
JL JL of lines in the most recent paragraph that lias been completed or partially 
completed. You can use \prevgraf in the context of a (number), and you can set 
\prevgraf to any desired nonnegative value if you want to make TgX think that it is in 
some particular part of the current paragraph shape. For example, let’s consider again 
a paragraph that contains four lines plus a display plus two more lines. When TjjX 
starts the paragraph, it sets \prevgraf=0; when it starts the display, \prevgraf will 
be 4; when it finishes the display, \prevgraf will be 7; and when it ends the paragraph, 
\prevgraf will be 9. If the display is actually one line taller than usual, you could set 
\prevgraf=8 at the beginning of the two final lines; then will think that a 10-line 
paragraph is being made. The value of \prevgraf affects line breaking only when Tj^X 
is dealing with nonstandard \parshape or \hangindent. 

EXERCISE 14.24 

JL JL Solve exercise 14.23 using \prevgraf . 

/gb/gb You are probably convinced by now that TjjX’s line-breaking algorithm has 
JL JL plenty of bells and whistles, perhaps even too many. But there’s one more 
feature, called “looseness”; some day you might find yourself needing it, when you are 
fine-tuning the pages of a book. If you set \looseness=l, TjrX will try to make the 
current paragraph one line longer than its optimum length, provided that there is a 
way to choose such breakpoints without exceeding the tolerance you have specified for 
the badnesses of individual lines. Similarly, if you set \looseness=2, TjjX will try to 
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make the paragraph two lines longer; and \looseness=-l causes an attempt to make 
it shorter. The general idea is that Tj'X first finds breakpoints as usual; then if the 
optimum breakpoints produce n lines, and if the current \looseness is l , TgX will 
choose the final breakpoints so as to make the final number of lines as close as possible 
to n + l without exceeding the current tolerance. Furthermore, the final breakpoints will 
have fewest total demerits, considering all ways to achieve the same number of lines. 

/>)/>) For example, you can set \looseness=l if you want to avoid a lonely “club 
JL JL line” or “widow line” on some page that does not have sufficiently flexible glue, 
or if you want the total number of lines in some two-column document to come out 
to be an even number. It’s usually best to choose a paragraph that is already pretty 
“full,” i.e., one whose last line doesn’t have much white space, since such paragraphs 
can generally be loosened without much harm. You might also want to insert a tie 
between the last two words of that paragraph, so that the loosened version will not 
end with only one “widow word” on the line; this tie will cover your tracks, so that 
people will find it hard to detect the fact that you have tampered with the spacing. 
On the other hand, TgX can take almost any sufficiently long paragraph and stretch it 
a bit, without substantial harm; the present paragraph is, in fact, one line looser than 
its optimum length. 

TeX resets the looseness to zero at the same time as it resets \hangindent, 
\hangafter, and \parshape. 

[► EXERCISE 14.25 

Explain what Tj?X will do if you set \looseness=-1000. 

Just before switching to horizontal mode to begin scanning a paragraph, TpX 
inserts the glue specified by \parskip into the vertical list that will contain 
the paragraph, unless that vertical list is empty so far. For example, ‘\parskip=3pt’ 
will cause 3 points of extra space to be placed between paragraphs. Plain TjgX sets 
\parskip=0pt pluslpt; this gives a little stretcliability, but no extra space. 

f After line breaking is complete, TjgX appends the lines to the current vertical 
list that encloses the current paragraph, inserting interline glue as explained in 
Chapter 12; this interline glue will depend on the values of \baselineskip, \lineskip, 
and \lineskiplimit that are currently in force. TjjX will also insert penalties into the 
vertical list, just before each glob of interline glue, in order to help control page breaks 
that might have to be made later. For example, a special penalty will be assessed for 
breaking a page between the first two lines of a paragraph, or just before the last line, 
so that “club” or “widow” lines that are detached from the rest of a paragraph will not 
appear all alone on a page unless the alternative is worse. 

f Here’s how interline penalties are calculated: IjrX has just chosen the break- 
points for some paragraph, or for some partial paragraph that precedes a 
displayed equation; and n lines have been formed. The penalty between lines j and 
j + 1, given a value of j in the range 1 < j < n, is the value of \interlinepenalty 
plus additional charges made in special cases: The \clubpenalty is added if j = 1, 
i.e., just after the first line; then the \displaywidowpenalty or the \widowpenalty is 
added if j = n — 1, i.e., just before the last line, depending on whether or not the 
current lines immediately precede a display; and finally the \brokenpenalty is added, 
if the j'th line ended at a discretionary break. (Plain T|'X sets \clubpenalty=150, 
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\widowpenalty=150, \displaywidowpenalty=50, and \brokenpenalty=100; the value 
of \interlinepenalty is normally zero, but it is increased to 100 within footnotes, so 
that long footnotes will tend not to be broken between pages.) 

(p, ►EXERCISE 14.26 

JL Consider a five-line paragraph in which the second and fourth lines end with 
hyphens. What penalties does plain TjgX put between the lines? 

►EXERCISE 14.27 

JL What penalty goes between the lines of a two-line paragraph? 

If you say \vadjust{(vertical list)} within a paragraph, TpX will insert the 
JL JL specified internal vertical list into the vertical list that encloses the paragraph, 
immediately after whatever line contained the position of the \vadjust. For example, 
you can say l \vadjust{\kernlpt}’ to increase the amount of space between lines of a 
paragraph if those lines would otherwise come out too close together. (The author 
did it in the previous line, just to illustrate what happens.) Also, if you want to 
make sure that a page break will occur immediately after a certain line, you can say 
‘\vadjust{\e ject}’ anywhere in that line. 

/>)/>) Later chapters discuss \ insert and \mark commands that are relevant to 
JL JL TjjX’s page builder. If such commands appear within a paragraph, they are 
removed from whatever horizontal lines contain them and placed into the enclosing 
vertical list, together with other vertical material from \vadjust commands that might 
be present. In the final vertical list, each horizontal line of text is an hbox that is 
immediately preceded by interline glue and immediately followed by vertical material 
that has “migrated out” from that line (with left to right order preserved, if there are 
several instances of vertical material); then comes the interline penalty, if it is nonzero. 
Inserted vertical material does not influence the interline glue. 

&)&)*■ EXERCISE 14.28 

JL JL Design a \marginalstar macro that can be used anywhere in a paragraph. It 
should use \vadjust to place an asterisk in the margin just to the left of the line where 
\marginalstar occurs. 

/>)/>) When TjgX enters horizontal mode, it will interrupt its normal scanning to read 
JL JL tokens that were predefined by the command \everypar={ (token list)}. For 
example, suppose you have said ‘\everypar={A} ! . If you type ‘B’ in vertical mode, TpX 
will shift to horizontal mode (after contributing \parskip glue to the current page), 
and a horizontal list will be initiated by inserting an empty box of width \parindent. 
Then will read ‘AB’, since it reads the \everypar tokens before getting back to the 
‘B’ that triggered the new paragraph. Of course, this is not a very useful illustration of 
\everypar; but if you let your imagination run you will think of better applications. 

EXERCISE 14.29 

JL JL Use \everypar to define an \insertbullets macro: All paragraphs in a group 
of the form ‘{\insertbullets . . Apar}’ should have a bullet symbol V as part of 
their indentation. 

/>)/>) A paragraph of zero lines is formed if you say ‘\noindent\par’. If \everypar 
JL JL is null, such a paragraph contributes nothing except \parskip glue to the 
current vertical list. 
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► EXERCISE 14.30 

Guess what happens if you say ‘\noindent$$ . . .$$ 


\par’. 


Experience has shown that l^rX’s line-breaking algorithm can be harnessed 
to a surprising variety of tasks. Here, for example, is an application that 
indicates one of the possibilities: Articles that are published in Mathematical Reviews 
are generally signed with the reviewer’s name and address, and this information is 
typeset flush right, i.e. , at the right-hand margin. If there is sufficient space to put 
such a name and address at the right of the final line of the paragraph, the publishers 
can save space, and at the same time the results look better because there are no 
strange gaps on the page. 


This is a case where the name and address fit in nicely 
with the review. A. Reviewer (Ann Arbor, Mich.) 

But sometimes an extra line must be added. 

N. Bourbaki (Paris) 


Let’s suppose that a space of at least two ems should separate the reviewer’s name 
from the text of the review, if they occur on the same line. We would like to design a 
macro so that the examples shown above could be typed as follows in an input file: 

... with the review, \signed A. Reviewer (Ann Arbor, Mich.) 

... an extra line must be added, \signed W. Bourbaki (Paris) 

Here is one way to solve the problem: 

\def\signed #1 (#2){{\unskip\nobreak\hfil\penalty50 
\hskip2em\hbox{}\nobreak\hf il\sl#l\/ \rm(#2) 

\parf illskip=0pt \f inalhyphendemerits=0 \par}} 


If a line break occurs at the \penalty50, the \hskip2em will disappear and the empty 
\hbox will occur at the beginning of a line, followed by \hf il glue. This yields two lines 
whose badness is zero; the first of these lines is assessed a penalty of 50. But if no line 
break occurs at the \penalty50, there will be glue of 2 em plus 2 fil between the review 
and the name; this yields one line of badness zero. TpX will try both alternatives, 
to see which leads to the fewest total demerits. The one-line solution will usually be 
preferred if it is feasible. 

<£)<£)>■ EXERCISE 14.31 

Explain what would happen if ‘\hbox{}’ were left out of the \signed macro. 
<£><£>► EXERCISE 14.32 

Why does the \signed macro say ‘\f inalhyphendemerits=0’ ? 


EXERCISE 14.33 

JL JL In one of the paragraphs earlier in this chapter, the author used \break to force 
a line break in a specific place; as a result, the third line of that particular paragraph was 
really spaced out. 

Explain why all the extra space went into the third line, instead of being distributed 
impartially among the first three lines. 
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/>)/>) If you want to avoid overfull boxes at all costs without trying to fix them manu- 
JL JL ally, you might be tempted to set tolerance=10000; this allows arbitrarily bad 
lines to be acceptable in tough situations. But infinite tolerance is a bad idea, because 
TJrX doesn’t distinguish between terribly bad and preposterously horrible lines. Indeed, 
a tolerance of 10000 encourages TJrX to concentrate all the badness in one place, making 
one truly unsightly line instead of two moderately bad ones, because a single “write- 
off” produces fewest total demerits according to the rules. There’s a much better way 
to get the desired effect: TgX has a parameter called \emergencystretch that is added 
to the assumed stretchability of every line when badness and demerits are computed, 
in cases where overfull boxes are otherwise unavoidable. If \emergencystretch is posi- 
tive, T^X will make a third pass over a paragraph before choosing the line breaks, when 
the first passes did not find a way to satisfy the \pretolerance and \tolerance. The 
effect of \emergencystretch is to scale down the badnesses so that large infinities are 
distinguishable from smaller ones. By setting \emergencystretch high enough (based 
on \hsize) you can be sure that the \tolerance is never exceeded: hence overfull boxes 
will never occur unless the line-breaking task is truly impossible. 


/>)/>>► EXERCISE 14.34 

JL JL Devise a \raggedcenter macro (analogous to \raggedright) that partitions 
the words of a paragraph into as few as possible lines of approximately equal size and 
centers each individual line. Hyphenation should be avoided if possible. 
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When the author objects to [a hyphenation] 
he should be asked to add or cancel or substitute 
a word or words that will prevent the breakage. 

Authors who insist on even spacing always, 
with sightly divisions always, 
do not clearly understand the rigidity of types. 
— T. L. DE VINNE, Correct Composition (1901) 

In reprinting his own works, whenever [William Morris] 
found a line that justified awkwardly, he altered the wording 
solely for the sake of making it look well in print. 

When a proof has been sent me with two or three 
lines so widely spaced as to make a grey band across the page, 
l have often rewritten the passage so as to fill up the lines better ; 
but I am sorry to say that my object has generally been so little 
understood that the compositor has spoilt all the rest 
of the paragraph instead of mending his former bad work. 
— GEORGE BERNARD SHAW, in The Dolphin (1940) 
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TgX attempts to choose desirable places to divide your document into individual 
pages, and its technique for doing this usually works pretty well. But the problem 
of page make-up is considerably more difficult than the problem of line breaking 
that we considered in the previous chapter, because pages often have much less 
flexibility than lines do. If the vertical glue on a page has little or no ability to 
stretch or to shrink, IpX usually has no choice about where to start a new page; 
conversely, if there is too much variability in the glue, the result will look bad 
because different pages will be too irregular. Therefore if you are fussy about 
the appearance of pages, you can expect to do some rewriting of the manuscript 
until you achieve an appropriate balance, or you might need to fiddle with the 
\looseness as described in Chapter 14; no automated system will be able to do 
this as well as you. 

Mathematical papers that contain a lot of displayed equations have an 
advantage in this regard, because the glue that surrounds a display tends to 
be quite flexible. TpX also gets valuable room to maneuver when you have 
occasion to use \smallskip or \medskip or \bigskip spacing between certain 
paragraphs. For example, consider a page that contains a dozen or so exercises, 
and suppose that there is 3 pt of additional space between exercises, where this 
space can stretch to 4pt or shrink to 2pt. Then there is a chance to squeeze 
an extra line on the page, or to open up the page by removing one line, in 
order to avoid splitting an exercise between pages. Similarly, it is possible to use 
flexible glue in special publications like membership rosters or company telephone 
directories, so that individual entries need not be split between columns or pages, 
yet every column appears to be the same height. 

For ordinary purposes you will probably find that Tj^X’s automatic 
method of page breaking is satisfactory. And when it occasionally gives un- 
pleasant results, you can force the machine to break at your favorite place by 
typing Xeject’. But be careful: \eject will cause TpX to stretch the page 
out, if necessary, so that the top and bottom baselines agree with those on other 
pages. If you want to eject a short page, filling it with blank space at the bottom, 
type ‘\vf ill\eject’ instead. 

f lf you say ‘\eject’ in the middle of a paragraph, the paragraph will end 
first, as if you typed ‘\par\eject’. But Chapter 14 mentions that you can say 
‘\vadjust{\e ject}’ in mid-paragraph, if you want to force a page break after whatever 
line contains your current position when the full paragraph is eventually broken up into 
lines; the rest of the paragraph will go on the following page. 


f To prevent a page break, you can say ‘\nobreak’ in vertical mode, just as 
\nobreak in horizontal mode prevents breaks between lines. For example, it 
is wise to say \nobreak between the title of a subsection and the first line of text in that 
subsection. But \nobreak does not cancel the effect of other commands like \eject 
that tell TfjX to break; it only inhibits a break at glue that immediately follows. You 
should become familiar with Ij^X’s rules for line breaks and page breaks if you want to 
maintain fine control over everything. The remainder of this chapter is devoted to the 
intimate details of page breaking. 
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f TjgX breaks lists of lines into pages by computing badness ratings and penal- 
ties, more or less as it does when breaking paragraphs into lines. But pages 
are made up one at a time and removed from Tj?X’s memory; there is no looking ahead 
to see how one page break will affect the next one. In other words, TjrX uses a special 
method to find the optimum breakpoints for the lines in an entire paragraph, but it 
doesn’t attempt to find the optimum breakpoints for the pages in an entire document. 
The computer doesn’t have enough high-speed memory capacity to remember the con- 
tents of several pages, so TjjX simply chooses each page break as best it can, by a 
process of “local” rather than “global” optimization. 


f Let’s look now at the details of TjjX’s page-making process. Everything you 
contribute to the pages of your document is placed on the main vertical list, 
which is the sequence of items that TjrjX has accumulated while in vertical mode. Each 
item in a vertical list is one of the following types of things: 

■ a box (an libox or vbox or rule); 

■ a “whatsit” (something special to be explained later); 

■ a mark (another thing that will be explained later); 

■ an insertion (yet another thing that we will get to); 

■ a glob of glue (or \leaders, as we will see later); 

■ a kern (something like glue that doesn’t stretch or shrink); 

■ a penalty (representing the undesirability of breaking here). 

The last three types (glue, kern, and penalty items) are called discardable, for the 
same reason that we called them discardable in horizontal lists. You might want to 
compare these specifications with the analogous rules for the horizontal case, found 
in Chapter 14; it turns out that vertical lists are just like horizontal ones except that 
character boxes, discretionary breaks, \vadjust items, and math shifts cannot appear 
in vertical lists. Chapter 12 exhibits a typical vertical list in TjjX’s internal box-and-glue 
representation. 



Page breaks can occur only at certain places within a vertical list. The per- 
missible breakpoints are exactly the same as in the horizontal case, namely 


a) at glue, provided that this glue is immediately preceded by a non-discardable 
item (i.e. , by a box, whatsit, mark, or insertion); 


b) at a kern, provided that this kern is immediately followed by glue; 


c) at a penalty (which might have been inserted automatically in a paragraph). 


Interline glue is usually inserted automatically between the boxes of a vertical list, as 
explained in Chapter 12, so there is usually a valid breakpoint between boxes. 


f As in horizontal lists, each potential breakpoint has an associated penalty, 
which is high for undesirable breakpoints and negative for desirable ones. The 
penalty is zero at glue and kern breaks, so it is nonzero only at explicit penalty breaks. 
If you say ‘\penalty-100’ between two paragraphs, you are indicating that TjgX should 
try to break here because the penalty is negative; a bonus of 100 points for breaking at 
this place will essentially cancel up to 100 units of badness that might be necessary to 
achieve such a break. A penalty of 10000 or more is so large that it inhibits breaking; 
a penalty of —10000 or less is so small that it forces breaking. 
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f Plain IjgX provides several control sequences that help to control page breaks. 

For example, \smallbreak, \medbreak, and \bigbreak specify increasingly 
desirable places to break, having respective penalties of —50, —100, and —200; further- 
more, they will insert a \smallskip, \medskip, or \bigskip of space, respectively, if a 
break is not taken. However, \smallbreak, \medbreak, and \bigbreak do not increase 
existing glue unnecessarily; for example, if you say \smallbreak just after a displayed 
equation, you won’t get a \smallskip of space in addition to the glue that already 
follows a display. Therefore these commands can conveniently be used before and after 
the statements of theorems, in a format for mathematical papers. In the present manual 
the author has used a macro that puts \medbreak before and after every dangerous- 
bend paragraph; \medbreak\medbreak is equivalent to a single \medbreak, so you don’t 
see two medskips when one such paragraph ends and another one begins. 


The \goodbreak macro is an abbreviation for ‘\par\penalty-500’. This is a 
good thing to insert in your manuscript when proofreading, if you are willing 
to stretch some page a little bit extra in order to improve the following one. Later on 
if you make another change so that this \goodbreak command does not appear near 
the bottom of a page, it will have no effect; thus it is not as drastic as \eject. 



f Tlie most interesting macro that plain IjgX provides for page make-up is called 
\filbreak. It means, roughly, “Break the page here and fill the bottom with 
blank space, unless there is room for more copy that is itself followed by \filbreak.” 
Thus if you put \f ilbreak at the end of every paragraph, and if your paragraphs aren’t 
too long, every page break will occur between paragraphs, and TjjX will fit as many 
paragraphs as possible on each page. The precise meaning of \f ilbreak is 


\vf il\penalty-200\vf ilneg 


according to Appendix B; and this simple combination of TgX’s primitives produces 
the desired result: If a break is taken at the \penalty-200, the preceding \vfil will 
fill the bottom of the page with blank space, and the \vfilneg will be discarded after 
the break; but if no break is taken at the penalty, the \vfil and \vfilneg will cancel 
each other and have no effect. 


f Plain TrX also provides a \raggedbottom command, which is a vertical analog 
of \raggedright: It tells TjrX to permit a small amount of variability in the 
bottom margins on different pages, in order to make the other spacing uniform. 

/gb/gb We saw in Chapter 14 that breakpoints for paragraphs are chosen by comput- 
JL JL ing “demerits” for each line and summing them over all lines. The situation 
for pages is simpler because each page is considered separately. T|’X figures the “cost” 
of a page break by using the following formula: 


' p, if b < oo and p < —10000 and q < 10000; 

b + p + q, if b < 10000 and —10000 < p < 10000 and q < 10000; 

100000, if b = 10000 and -10000 < p < 10000 and q < 10000; 

_ oo, if (b = oo or q > 10000) and p < 10000. 


Here b is the badness of the page that would be formed if a break were chosen here; 
p is the penalty associated with the current breakpoint; and q is ‘\insertpenalties’, 
the sum of all penalties for split insertions on the page, as explained below. Vertical 
badness is computed by the same rules as horizontal badness; it is an integer between 
0 and 10000, inclusive, except when the box is overfull, when it is oo (infinity). 
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/gK/gK When a page is completed, it is removed from the main vertical list and passed 
JL JL to an “output routine,” as we will see later; so its boxes and glue eventually 
disappear from TjjX’s memory. The remainder of the main vertical list exists in two 
parts: First comes the “current page,” which contains all the material that TjjX has 
considered so far as a candidate for the next page to be broken off; then there are 
“recent contributions,” i.e., items that will be moved to the current page as soon as 
TgX finds it convenient to do so. If you say \showlists, TjrX will display the contents 
of the current page and the recent contributions, if any, on your log file. (The example 
in Chapter 13 doesn’t show any such lists because they were both empty in that case. 
Chapter 24 explains more about TjrX’s timing.) 

/gN/gN Whenever TjjX is moving an item from the top of the “recent contributions” to 
JL JL the bottom of the “current page,” it discards a discardable item (glue, kern, or 
penalty) if the current page does not contain any boxes. This is how glue disappears at 
a page break. Otherwise if a discardable item is a legitimate breakpoint, T)?X calculates 
the cost c of breaking at this point, using the formula that we have just discussed. If the 
resulting c is less than or equal to the smallest cost seen so far on the current page, TjgX 
remembers the current breakpoint as the best so far. And if c = 00 or if p < —10000, 
TjrX seizes the initiative and breaks the page at the best remembered breakpoint. Any 
material on the current page following that best breakpoint is moved back onto the 
list of recent contributions, where it will be considered again; thus the “current page” 
typically gets more than one page’s worth of material before the breakpoint is chosen. 

/gL/gL This procedure may seem mysterious until you see it in action. Fortunately, 
JL JL there is a convenient way to watch it; you can set \tracingpages=l, thereby 
instructing TjrX to put its page-cost calculations into your log file. For example, here is 
what appeared on the log file when the author used \tracingpages=l at the beginning 
of the present chapter: 


goal height=528. 0, max depth=2.2 
*/. t=10. 0 g=528. 0 b=10000 p=150 c=100000# 

*/. t=22 . 0 g=528. 0 b=10000 p=0 c=100000# 

’/. t=34. 0 g=528. 0 b=10000 p=0 c=100000# 

: (25 similar lines are being omitted here) 

’/. t=346 . 0 plus 2.0 g=528. 0 b=10000 p=0 c=100000# 

*/. t=358 . 0 plus 2.0 g=528. 0 b=10000 p=150 c=100000# 

’/. t=370. 02223 plus 2.0 g=528.0 b=10000 p=-100 c=100000# 

/ t=398.0 plus 5.0 minus 2.0 g=528.0 b=10000 p=0 c=100000# 

'I, t=409.0 plus 5.0 minus 2.0 g=528.0 b=10000 p=0 c=100000# 

•/. t=420 . 0 plus 5.0 minus 2.0 g=528.0 b=10000 p=150 c=100000# 

•/. t=431 . 0 plus 5.0 minus 2.0 g=528.0 b=10000 p=-100 c=100000# 

’/. t=459 . 0 plus 8.0 minus 4.0 g=528.0 b=10000 p=0 c=100000# 

’/. t=470 . 0 plus 8.0 minus 4.0 g=528.0 b=10000 p=0 c=100000# 

'/. t=481 . 0 plus 8.0 minus 4.0 g=528.0 b=10000 p=0 c=100000# 

•/. t=492 . 0 plus 8.0 minus 4.0 g=528.0 b=10000 p=0 c=100000# 

*/, t=503.0 plus 8.0 minus 4.0 g=528.0 b=3049 p=0 c=3049# 

/ t=514.0 plus 8.0 minus 4.0 g=528.0 b=533 p=150 c=683# 

’/, t=525.0 plus 8.0 minus 4.0 g=528.0 b=5 p=-100 c=-95# 

'/, t=553.0 plus 11.0 minus 6.0 g=528.0 b=* p=0 c=* 
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current page 
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tracingpages 
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This trace output, is admittedly not “user-friendly” in appearance, but after all it comes 
from deep inside TjjX’s bowels where things have been reduced to numeric calculations. 
You can learn to read it with a little practice, but you won’t need to do so very often 
unless you need to plunge into page-breaking for special applications. Here’s what it 
means: The first line, which starts with is written when the first box or insertion 
enters the current page list; it shows the “goal height” and the “max depth” that will 
be used for that page (namely, the current values of \vsize and \maxdepth). In the 
present manual we have \vsize=44pc and \maxdepth=2 . 2pt; dimensions in the log file 
are always displayed in points. The subsequent lines, which start with a single are 
written whenever a legal breakpoint is being moved from the list of recent contributions 
to the current page list. Every '/, line shows t, which is the total height so far if a page 
break were to occur, and g , which is the goal height; in this example g stays fixed at 
528 pt, but g would have decreased if insertions such as footnotes had occurred on the 
page. The values of t are steadily increasing from 10 to 22 to 34, etc.; baselines are 
12 pt apart at the top of the page and 11 pt apart at the bottom (where material is 
set in nine-point type). We are essentially seeing one ’/, line per hbox of text being 
placed on the current page. However, the ’/, lines are generated by the penalty or glue 
items that follow the hboxes, not by the boxes themselves. Each 1 line shows also the 
badness 6, the penalty p, and the cost c associated with a breakpoint; if this cost is the 
best so far, it is marked with a “#’ sign, meaning that “this breakpoint will be used for 
the current page if nothing better comes along.” Notice that the first 40 or so breaks 
all have b = 10000, since they are so bad that TjrX considers them indistinguishable; in 
such cases c = 100000, so TjrX simply accumulates material until the page is full enough 
to have b < 10000. A penalty of 150 reflects the \clubpenalty or the \widowpenalty 
that was inserted as described in Chapter 14. The three lines that say p=-100 are 
the breakpoints between “dangerous bend” paragraphs; these came from \medbreak 
commands. The notation b=* and c=* on the final line means that b and c are infinite; 
the total height of 553 pt cannot be reduced to 528 pt by shrinking the available glue. 
Therefore the page is ejected at the best previous place, which turns out to be a pretty 
good break: b=5 and p=-100 yield a net cost of —95. 

EXERCISE 15.1 

JL JL Suppose the paragraph at the bottom of the example page had been one line 
shorter; what page break would have been chosen? 

(£)(£)*■ EXERCISE 15.2 

JL JL The last two lines” of this example show the natural height of t, jumping 
by 28 pt, from 525.0 to 553.0. Explain why there was such a big jump. 

/gb/gb The \maxdepth parameter tells TpX to raise the bottom box on the page if 
JL JL that box has too much depth, so that the depth of the constructed page will 
not exceed a specified value. (See the discussion of \boxmaxdepth in Chapter 12.) In 
our example \maxdepth=2 . 2pt, and the influence of this parameter can be seen in the 
line that says 7, t=370 . 02223’. Ordinarily f would have been 370.0 at that breakpoint; 
but the hbox preceding it was unusual because it contained the letter j in \tt, and 
a 10-point typewriter-style j descends 2.22223 pt below the baseline. Therefore Tj?X 
figured badness as if the hbox were .02223 pt higher and only 2.2 pt deep. 



Notice that the first “’/, line” of our example says t=10.0; this is a consequence 
of another parameter, called \topskip. Glue disappears at a page break, but 
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it is desirable to produce pages whose top and bottom baselines occur in predetermined 
positions, whenever possible; therefore TpX inserts special glue just before the first box 
on each page. This special glue is equal to \topskip, except that the natural space 
has been decreased by the height of the first box, or it has been set to zero in lieu 
of a negative value. For example, if \topskip=20pt plus2pt, and if the first box on 
the current page is 13 pt tall, TjjX inserts ‘\vskip7pt plus2pt’ just above that box. 
Furthermore, if the first box is more than 20 pt tall, ‘\vskipOpt plus2pt’ is inserted. 
But this example is atypical, since the \topskip glue usually has no stretchability or 
shrinkability; plain Tj?X sets \topskip=10pt. 

(£)(£)*■ EXERCISE 15.3 

JL JL Assume that \vsize=528pt, \maxdepth=2 . 2pt, \topskip=10pt, and that no 
\insert commands are being used. IjjX will make pages that are 528 pt high, and the 
following two statements will normally be true: (a) The baseline of the topmost box 
on the page will be 10 pt from the top, i.e., 518 pt above the baseline of the page itself, 
(b) The baseline of the bottommost box on the page will coincide with the baseline of 
the page itself. Explain under what circumstances (a) and (b) will fail. 

/>)/>) Since \vsize, \maxdepth, and \topskip are parameters, you can change them 
JL JL at any time; what happens if you do? Well, TjgX salts away the values of 
\vsize and \maxdepth when it prints the line,” i.e., when the first box or insertion 
occurs on the current page; subsequent changes to those two parameters have no effect 
until the next current page is started. On the other hand, TgX looks at \topskip only 
when the first box is being contributed to the current page. If insertions occur before 
the first box, the \topskip glue before that box is considered to be a valid breakpoint; 
this is the only case in which a completed page might not contain a box. 

/>)/>) You can look at the t and g values that are used in page breaking by referring 
JL JL to the {dimen) values ‘\pagetotal’ and ‘\pagegoal’, respectively. You can 
even change them (but let’s hope that you know what you are doing). For example, 
the command \pagegoal=500pt overrides the previously saved value of \vsize. Be- 
sides \pagetotal, which represents the accumulated natural height, TjrX maintains the 
quantities \pagestretch, \pagef ilstretch, \pagef illstretch, \pagef illlstretch, 
\pageshrink, and \pagedepth. When the current page contains no boxes, \pagetotal 
and its relatives are zero and \pagegoal is 16383.99998 pt (TeX’s largest (dimen)); 
changing their values has no effect at such times. The integer q in the formula for page 
costs is also available for inspection and change; it is called \insertpenalties. 

/gb/gb Page breaking differs from line breaking in one small respect that deserves 
JL JL mention here: If you say \e ject\e ject, the second \eject is ignored, because 
it is equivalent to \penalty-10000 and penalties are discarded after a page break. But if 
you say \break\break in a paragraph, the second \break causes an empty line, because 
penalties are discarded after a break in a paragraph only if they do not belong to the 
final sequence of breakpoints. This technicality is unimportant in practice, because 
\break\break isn’t a good way to make an empty line; that line will usually be an 
underfull hbox, since it has only the \leftskip and \rightskip glue in it. Similarly, 
‘\eject\eject’ would not be a good way to make an empty page, even if IjgX were to 
change its rules somehow so that an \eject would never be ignored. The best way to 
eject an empty page is to say ‘\eject\line{}\vfil\eject’, and the best way to create 
an empty line is ‘\break\hbox{}\hf il\break’. Both of these avoid underfull boxes. 
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f You are probably wondering how page numbers and such things get attached 
to pages. The answer is that Tf-'X allows you to do further processing after 
each page break has been chosen; a special “output routine” goes into action before 
pages actually receive their final form. Chapter 23 explains how to construct output 
routines and how to modify the output routine of plain 1)?X. 

f Every once in a while, TpjX will produce a really awful-looking page and you 
will wonder what happened. For example, you might get just one paragraph 
and a lot of white space, when some of the text on the following page would easily fit 
into the white space. The reason for such apparently anomalous behavior is almost 
always that no good page break is possible; even the alternative that looks better to 
you is quite terrible as far as Tj^X is concerned! TjXX does not distinguish between two 
choices that both have 10000 units of badness or more, even though some bad breaks 
do look much worse than others. The solution in such cases is to insert \eject or 
\vfill\eject in some acceptable spot, or to revise the manuscript. If this problem 
arises frequently, however, you probably are using a format that sets overly strict 
limitations on page format; try looking at the output of \tracingpages and modifying 
some of TjgX’s parameters, until you have better luck. 

f Tlie remainder of this chapter is about insertions: things like footnotes and 
illustrations, and how they interact with page breaks. Before we discuss the 
primitive operations by which Tj^X deals with insertions, we will take a look at the 
facilities that plain provides at a higher level. 

f Illustrations can be inserted in several ways using plain TjrX. The simplest of 
these is called a “floating topinsert” ; you say 

\topinsert(vertical mode material)\endinsert 

and Ij-yX will attempt to put the vertical mode material at the top of the current page. 
If there’s no room for such an insertion on one page, TjgX will insert it at the top of 
the next page. The (vertical mode material) can contain embedded paragraphs that 
temporarily interrupt vertical mode in the usual way; for example: 

\topinsert \vskip 2in 
\hsize=3in \raggedright 

\noindent{\bf Figure 3.} This is the caption to the 
third illustration of my paper. I have left two inches 
of space above the caption so that there will be room 
to introduce special artwork, \endinsert 

The caption in this example will be set ragged-right in a 3-inch column at the left of 
the page. Plain TpX automatically adds a “bigskip” below each topinsert; this will 
separate the caption from the text. The effects of \hsize=3in and \raggedright do 
not extend past the \endinsert, since grouping is implied. 

►EXERCISE 15.4 

JL Modify this example so that the caption is moved over next to the right margin, 
instead of appearing at the left. 

f Similarly, if you say ‘\pageinsert (vertical mode material) \endinsert’, the 
vertical mode material will be justified to the size of a full page (without a 
bigskip below it); the result will appear on the following page. 
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f There’s also ‘\midinsert (vertical mode material) \endinsert’, which tries 
first to insert the material in place, wherever you happen to be, in the middle 
of the current page. If there is enough room, you get the effect of 

\bigskip\vbox{ (vertical mode material) }\bigbreak 


otherwise the \midinsert is effectively converted to a \topinsert. There is a slight 
probability that \midinsert will not find the best placement, because TpX is sometimes 
processing text ahead of the current page. You may want to say ‘\goodbreak’ just 
before \midinsert. 


f You should use the commands \topinsert, \pageinsert, \midinsert in ver- 
tical mode (i.e., between paragraphs), not inside of boxes or other insertions. 

f lf you have two or more \topinsert or \pageinsert commands in quick suc- 
cession, T)rX may need to carry them over to several subsequent pages; but 
they will retain their relative order when they are carried over. For example, suppose 
you have pages that are nine inches tall, and suppose you have already specified 4 inches 
of text for some page, say page 25. Then suppose you make seven t.opinserts in a row, of 
respective sizes 1, 2, 3, 9, 3, 2, 1 inches; the 9-inch one is actually a \pageinsert. What 
happens? Well, the first, and second will appear at the top of page 25, followed by the 
4 inches of copy you have already typed; that copy will immediately be followed by two 
more inches that you type after the seven inserts. The third t.opinsert will appear at 
the top of page 26, followed by six more inches of text; the fourth will fill page 27; and 
the remaining three will appear at the top of page 28. 

►EXERCISE 15.5 

JL What would happen in the example just discussed if the final 1-inch insertion 
were a \midinsert instead of a \topinsert? 

f At the end of a paper, you probably want to make sure that no insertions 
are lost; and at the end of a chapter, you probably want to make sure that 
no insertions float into the following chapter. Plain TjjX will flush out all remain- 
ing insertions, with blank space filling the bottom of incomplete pages, if you say 
‘\vf ill\supere ject’. 

f Besides illustrations that are inserted at the top of a page, plain TjgX will also 
insert footnotes at the bottom of a page. The \footnote macro is provided 
for use within paragraphs;* for example, the footnote in the present sentence was typed 
in the following way: 


... paragraphs ; \footnote*{Like this.} for example, ... 

There are two parameters to a \footnote; first comes the reference mark, which will 
appear both in the paragraph** and in the footnote itself, and then comes the text of 
the footnote. 46 The latter text may be several paragraphs long, and it may contain 
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* Like this. 

** The author typed ‘paragraph\f ootnote{**MThe author . . .}’ here. 

46 And ‘footnote . \footnote{$~{45}$}{And . . .}’ here. The footnotes in this manual 
appear in smaller type, and they are set with hanging indentation; furthermore a 
smallskip occurs between footnotes on the same page. But in plain Tp]X, footnotes 
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displayed equations and such things, but it should not involve other insertions. TjyX 
will ensure that each footnote occurs at the bottom of the same page as its reference, f 
A long footnote will be split, if necessary, and continued at the bottom of the following 
page, as you can see in the somewhat contrived example that appears here. Authors 
who are interested in good exposition should avoid footnotes whenever possible, since 
footnotes tend to be distracting 4 

f Tlie \footnote macro should be used only in paragraphs or hboxes that are 
contributed to TftX’s main vertical list; insertions will be lost if they occur 
inside of boxes that are inside of boxes. Thus, for example, you should not try to put 
a \footnote into a subformula of a math formula. But it’s OK to use footnotes within 
\centerline, e.g., 

\centerline{A paper by A. U. Thor'/, 

\footnote*{Supported by NSF.}} 

or even on the outer level of a table entry inside an \halign. 

(&)&) Topinserts work fine by themselves, and footnotes work fine by themselves, 
JL JL but complications can arise when you try to mix them in devious ways. For 
example, if a \page insert floats to the page that follows a long footnote that had 
to be broken, both of the held-over insertions may try to force themselves onto the 
same page, and an overfull vbox may result. Furthermore, insertions cannot appear 
within insertions, so you can’t use \footnote within a \topinsert. If you really need 
a footnote in some caption, there’s a \vfootnote macro that can be used in vertical 
mode. To use it, you put a reference mark like in the caption, and then you say 
‘\vf ootnote*{The footnote}’ somewhere on the page where you guess that the caption 
will finally fall. In such complex circumstances you might want to rethink whether or 
not you are really using the most appropriate format for the exposition of your ideas. 

/gb/gb Chapter 24 explains the exact rules about migration of vertical-mode material 
JL JL (like footnotes) from horizontal lists to the enclosing vertical list. Insertions, 
marks, and the results of \vadjust all migrate in the same fashion. 

f Now let’s study the primitives of T|-X that are used to construct macros like 
\topinsert and \footnote. We are about to enter behind the scenes into a 
sublanguage of T}rX that permits users to do complex manipulations with boxes and 
glue. Our discussion will be in two parts: First we shall consider TgX’s “registers,” 
with which a user can do arithmetic related to typesetting; and then we shall discuss 
the insertion items that can appear in horizontal and vertical lists. Our discussion 
of the first topic (registers) will be marked with single dangerous-bend signs, since 
registers are of general use in advanced applications of whether or not they relate 
to insertions. But the second topic will be marked with double dangerous-bend signs, 
since insertions are rather esoteric. 


are typeset with the normal size of type, with \text indent used for the reference 
mark, and without extra smallskips. The \textindent macro is like \item, but it 
omits hanging indentation. 

f Printers often use the symbols \dag (f), \ddag (}), \S (§), and \P (^[) as reference 
marks; sometimes also $\ I $ ([]). You can say, e.g., ‘\footnote\dag{. . .}’. 

I Yet Gibbon’s Decline and Fall would not have been the same without footnotes. 
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f Tj^X has 256 registers called \countO to \count255, each capable of containing 
integers between —2147483647 and +2147483647, inclusive; i.e., the magni- 
tudes should be less than 2 31 . TgX also has 256 registers called \dimenO to \dimen255, 
each capable of containing a {dimen) (see Chapter 10). There are another 256 registers 
called \skipO to \skip255, each containing (glue) (see Chapter 12); and \muskipO to 
\muskip255, each containing (muglue) (see Chapter 18). You can assign new values to 
these registers by saying 

\ count (number) = (number) 

\dimen(number) = (dirnen) 

\ skip (number) = (glue) 

\muskip (number) = (muglue) 

and then you can add or subtract values of the same type by saying 

\advance\count(number) by (number) 

\advance\dimen(number) by (dirnen) 

\advance\skip(number) by (glue) 

\advance\muskip(number) by (muglue) 


For example, ‘\dimen8=\hsize \advance\dimen8 by lin’ sets register \dimen8 to an 
inch more than the current value of the normal line size. 



If infinite glue components are added, lower order infinities disappear. For 
example, after the two commands 

\skip2 = Opt plus 2fill minus 3fill 
\advance\skip2 by 4pt plus lfil minus 2filll 


the value of \skip2 will be 4pt plus 2 fill minus 2 filll. 

f Multiplication and division are possible too, but only by integers. For example, 
‘\multiply\dimen4 by 3’ triples the value of \dimen4, and ‘\divide\skip5 
by 2’ cuts in half all three components of the glue that is currently registered in \skip5. 
You shouldn’t divide by zero, nor should you multiply by numbers that will make the 
results exceed the register capacities. Division of a positive integer by a positive integer 
discards the remainder, and the sign of the result changes if you change the sign of 
either operand. For example, 14 divided by 3 yields 4; —14 divided by 3 yields —4; 
— 14 divided by —3 yields 4. Dimension values are integer multiples of sp (scaled points). 

f You can use any \ count register in the context of a (number), any \dimen 
register in the context of a (dirnen), any \skip register in the context of (glue), 
and any \muskip register in the context of (muglue). For example, ‘\hskip\skipl’ puts 
horizontal glue into a list, using the value of \skipl; and if \count5 is 20, the command 
‘\advance\dimen20 by\dimen\count5’ is equivalent to ‘\multiply\dimen20 by 2’. 

f A \dimen register can be used also in the context of a (number), and a \skip 
register can be used as a (dirnen) or a (number). T)?X converts (glue) to 
(dirnen) by omitting the stretch and shrink components, and it converts (dirnen) to 
(number) by assuming units of sp (scaled points). For example, if \skipl holds the 
value 1 pt plus 2pt, then ‘\dimenl=\skipl’ sets \dimenl equal to 1 pt; and the com- 
mands l \count2=\dimenl’ or l \count2=\skipl’ will set \count2 equal to 65536. These 
rules also apply to TgX’s internal parameters; for example, ‘\dimen2=\baselineskip’ 
will set \dimen2 to the natural space component of the current baselineskip glue. 
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/>, ►EXERCISE 15.6 

JL Test your knowledge of TjgX’s registers by stating the results of each of the 
following commands when they are performed in sequence: 


\countl=50 \dimen2=\countlpt \divide\countl by 8 
\skip2=-10pt plus\countlf il minus\dimen2 
\multiply\skip2 by-\countl \divide\skip2 by \dimen2 
\skipl= . 5\dimen2 plus\skip2 minus \count\ count If ill 
\multiply\skip2 by\skipl \advance\skipl by-\skip2 


\count6=\skip2 


/£> ►EXERCISE 15.7 

JL What is in \skip5 after the following three commands have acted? 

\skip5=0pt plus lpt 

\advance\skip5 by \skip4 \advance\skip5 by -\skip4 
►EXERCISE 15.8 

JL (For mathematicians.) Explain how to round \dimen2 to the nearest multiple 
of \dimen3, assuming that \dimen3 is nonzero. 

f The registers obey TjgX’s group structure. For example, changes to \count3 
inside {...} will not affect the value of \count3 outside. Therefore TjjX 
effectively has more than 256 registers of each type. If you want the effect of a register 
command to transcend its group, you must say \global when you change the value. 

►EXERCISE 15.9 

JL What is in \ count 1 after the following sequence of commands? 

\countl=5 {\countl=2 \global\advance\countlby\countl 
\advance\countlby\countl} 

f The first ten \count registers, \countO through \count9, are reserved for a 
special purpose: IjgX displays these ten counts on your terminal whenever 
outputting a page, and it transmits them to the output file as an identification of that 
page. The counts are separated by decimal points on your terminal, with trailing 1 . O’ 
patterns suppressed. Thus, for example, if \count0=5 and \count2=7 when a page is 
being shipped out to the dvi file, and if the other count registers are zero, Tj^X will 
type ‘[5.0.7]’. Plain IjjX uses \countO for the page number, and it keeps \countl 
through \count9 equal to zero; that is why you see just ‘[1]’ when page 1 is being 
output. In more complex applications the page numbers can have further structure; 
ten counts are shipped out so that there will be plenty of identification. 

f it’s usually desirable to have symbolic names for registers. Tj?X provides a 
\countdef command (similar to \chardef, cf. Chapter 8), which makes it 
easy to do this: You just say 


round 

group structure 

global 

dvi 

[i] 

countdef 

dimendef 

skipdef 

muskipdef 

parameter 


\ countdef \chapno=28 


and \chapno is henceforth an abbreviation for \ count 28. Similar commands \dimendef , 
\skipdef , and \muskipdef are available for the other types of numeric registers. After 
a control sequence has been defined by \countdef, it can be used in TjjX commands 
exactly as if it were an integer parameter like \tolerance. Similarly, \dimendef ef- 
fectively creates a new dimension parameter, \ skipdef effectively creates a new glue 
parameter, and \muskipdef effectively creates a new niuglue parameter. 
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f Besides the numerical registers, TjjX also has 256 box registers called \boxO to 
\box255. A box register gets a value when you say \setbox(number)=(box); 
for example, ‘\setbox3=\hbox{A}’ sets \box3 to an hbox that contains the single let- 
ter A. Several other examples of \setbox have already appeared in Chapter 12. Chap- 
ter 10 points out that ‘2\wd3’ is a {dimen) that represents twice the width of \box3; 
similarly, \ht (number) and \ dp (number) can be used to refer to the height and depth 
of a given box register. 

f liox registers are local to groups just as arithmetic registers are. But there’s a 
big difference between box registers and all the rest: When you use a \box, it 
loses its value. For example, the construction ‘\raise2pt\box3’ in a horizontal list not 
only puts the contents of \box3 into the list after raising it by 2 pt, it also makes \box3 
void. Tj^X does this for efficiency, since it is desirable to avoid copying the contents 
of potentially large boxes. If you want to use a box register without wiping out its 
contents, just say ‘\copy’ instead of ‘\box’; for example, ‘\raise2pt\copy3’. 


Another way to use a box register is to extract the inside of an hbox by saying 
‘\unhbox’. This annihilates the contents of the register, like ‘\box’ does, and 
it also removes one level of boxing. For example, the commands 



\setbox3=\hbox{A} \setbox3=\hbox{\box3 B} 
\setbox4=\hbox{A} \setbox4=\hbox{\unhbox4 B} 


put \hbox{\hbox{A}B} into \box3 and \hbox{AB} into \box4. Similarly, \unvbox un- 
wraps a vbox. If you want to construct a large box by accretion (e.g., a table of 
contents), it is best to use \unhbox or \unvbox as in the \setbox4 example; otherwise 
you use more of TjrX’s memory space, and you might even obtain boxes inside boxes 
nested to such a deep level that hardware or software limits are exceeded. 



The operations \unhcopy and \unvcopy are related to \unhbox and \unvbox 
as \copy is to \box. (But their names are admittedly peculiar.) 

An unboxing operation “unsets” any glue that was set at the box’s outer level. 
For example, consider the sequence of commands 


\setbox5=\hbox{A \hbox{B C}} \setbox6=\hbox to 1 . 05\wd5{\unhcopy5} 


This makes \box6 five percent wider than \box5; the glue between A and \hbox{B C} 
stretches to make the difference, but the glue inside the inner hbox does not change. 

f A box register is either “void” or it contains an libox or a vbox. There is a 
difference between a void register and one that contains an empty box whose 
height, width, and depth are zero; for example, if \box3 is void, you can say \unhbox3 
or \unvbox3 or \unhcopy3 or \unvcopy3, but if \box3 is equal to \hbox{} you can say 
only \unhbox3 or \unhcopy3. If you say ‘\global\setbox3=(box)’, register \box3 will 
become “globally void” when it is subsequently used or unboxed. 



► EXERCISE 15.10 

Wliat is in register \box5 after the following commands? 



\setbox5=\hbox{A} \setbox5=\hbox{\copy5\unhbox5\box5\unhcopy5} 

► EXERCISE 15.11 

And what’s in \box3 after l {\global\setbox3=\hbox{A}\setbox3=\hbox{}}’ ? 
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If you are unsure about how T)gX operates on its registers, you can experiment 
online by using certain ‘\show’ commands. For example, 

\showthe\ count 1 \showthe\dimen2 \showthe\skip3 

will display the contents of \countl, \dimen2, and \skip3; and ‘\showbox4’ will dis- 
play the contents of \box4. Box contents will appear only in the log file, unless you 
say ‘\tracingonline=r. Plain l)gX provides a macro ‘\tracingall’ that turns on 
every possible mode of interaction, including \tracingonline. The author used these 
features to check the answers to several of the exercises above. 

Large applications of TgX make use of different sets of macros written by 
different groups of people. Chaos would reign if a register like \countl00, say, 
were being used simultaneously for different purposes in different macros. Therefore 
plain TjgX provides an allocation facility; cooperation will replace confusion if each 
macro writer uses these conventions. The idea is to say, e.g., ‘\newcount’ when you 
want to dedicate a \count register to a special purpose. For example, the author 
designed a macro called ‘\exercise’ to format the exercises in this manual, and one of 
the features of \exercise is that it computes the number of the current exercise. The 
format macros in Appendix E reserve a \ count register for this purpose by saying 

\newcount\exno 

and then the command ‘\exno=0’ is used at the beginning of each chapter. Similarly, 
‘\advance\exno byl’ is used whenever a new exercise conies along, and ‘\the\exno’ 
is used to typeset the current exercise number. The \newcount operation assigns a 
unique count register to its argument \exno, and it defines \exno with a \countdef 
command. All of the other format macros are written without the knowledge of exactly 
which \count register actually corresponds to \exno. 

Besides \newcount, plain T)gX provides \newdimen, \newskip, \newmuskip, 
and \newbox; there also are \newtoks, \newread, \newwrite, \newfam, and 
\newinsert, for features we haven’t discussed yet. Appendices B and E contain sev- 
eral examples of the proper use of allocation. In the cases of \newbox, \newread, 
etc., the allocated number is defined by \chardef. For example, if the command 
‘\newbox\abstract’ is used to define a box register that will contain an abstract, 
and if the \newbox operation decides to allocate \box45 for this purpose, then it 
defines the meaning of \abstract by saying ‘\chardef\abstract=45’. TfgX allows 
Xchardef’d quantities to be used as integers, so that you can say \box\abstract and 
\copy\abstract, etc. (There is no \boxdef command.) 

► EXERCISE 15.12 

Design a \note macro that produces footnotes numbered sequentially. For 
example, 1 it should produce the footnotes here 2 if you type 

... example , \note{First note.} it should produce 
the footnotes here\note{Second note.} if ... 

(Use \newcount to allocate a \count register for the footnotes.) 
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1 First note. 

2 Second note. 
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Sometimes, however, you want to use a register just for temporary storage, 
and you know that it won’t conflict with anybody else’s macros. Registers 
\count255, \dimen255, \skip255, and \muskip255 are traditionally kept available for 
such purposes. Furthermore, plain TjrjX reserves \dimenO to \dimen9, \skipO to \skip9, 
\muskipO to \muskip9, and \boxO to \box9 for “scratchwork” ; these registers are never 
allocated by the \new. . . operations. We have seen that \countO through \count9 are 
special, and \box255 also turns out to be special; so those registers should be avoided 
unless you know what you are doing. 



/gb/gb Of course any register can be used for short-term purposes inside a group 
JL JL (including \countO to \count9 and \box255, and including registers that have 
been allocated for other purposes), since register changes are local to groups. However, 
you should be sure that TfgX will not output any pages before the group has ended, 
because output routines might otherwise be invoked at unfortunate times. TjrX is liable 
to invoke an output routine whenever it tries to move something from the list of recent 
contributions to the current page, because it might discover a page break with c = oo 
then. Here is a list of the times when that can happen: (a) At the beginning or end of a 
paragraph, provided that this paragraph is being contributed to the main vertical list, 
(b) At the beginning or end of a displayed equation within such a paragraph, (c) After 
completing an \halign in vertical mode, (d) After contributing a box or penalty or 
insertion to the main vertical list, (e) After an \output routine has ended. 


/gb/gb Now that we are armed with the knowledge of TjrjX’s flexible registers, we 
JL JL can plunge into the details of insertions. There are 255 classes of insertions, 
\insertO to \insert254, and they are tied to other registers of the same number. 
For example, \insertlOO is connected with \countl00, \dimenl00, \skiplOO, and 
\boxl00. Therefore plain TjjX provides an allocation function for insertions as it does 
for registers; Appendix B includes the command 


\newinsert\footins 


which defines \footins as the number for footnote insertions. Other commands that 
deal with footnotes refer to \count\footins, \dimen\footins, and so on. The macros 
for floating topinserts are similarly prefaced by ‘\newinsert\topins’, which defines 
\topins as the number of their class. Each class of insertions is independent, but Tj/X 
preserves the order of insertions within a class. It turns out that \footins is class 254, 
and \topins is class 253, but the macros do not use such numbers directly. 

/gb/gb For our purposes let’s consider a particular class of insertions called class n; 
JL JL we will then be dealing with IjvX’s primitive command 

\ insert n{ (vertical mode material)} 

which puts an insertion item into a horizontal or vertical list. For this class of insertions 

\box n is where the material appears when a page is output; 

\ count n is the magnification factor for page breaking; 

\dimen n is the maximum insertion size per page; 

\skip n is the extra space to allocate on a page. 

For example, material inserted with \insertl00 will eventually appear in \boxlOO. 


groups 

output routines, when invoked 
page builder, when exercised 
newinsert 
insert 
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/gK/gK Let the natural height plus depth of \insertn be x\ then \count n is 1000 
JL JL times the factor by which x affects the page goal. For example, plain TpX sets 
\count\f ootins=1000, since there is a one-to-one relationship: A 10-point footnote 
effectively makes a page 10 pt shorter. But if we have an application where footnotes 
appear in double columns, a count value of 500 would be appropriate. One of the 
insertion classes in Appendix E makes marginal notes for proofreading purposes; in 
that case the count value is zero. No actual magnification is done; \ count n is simply 
a number used for bookkeeping, when estimating the costs of various page breaks. 

/gN/gN The first footnote on a page requires extra space, since we want to separate 
JL JL the footnotes from the text, and since we want to output a horizontal rule. 
Plain T)?X sets ‘\skip\f ootins=\bigskipamount’; this means that a bigskip of extra 
space is assumed to be added by the output routine to any page that contains at least 
one insertion of class \footins. 

/>)/>) Sometimes it is desirable to put a maximum limitation on the size of insertions; 
JL JL for example, people usually don’t want an entire page to consist of footnotes. 
Plain TjrX sets \dimen\f ootins=8in; this means that \box\footins is not supposed 
to accumulate more than 8 inches of footnotes for any one page. 

/gL/gL You might want to review the page-breaking algorithm explained at the be- 
JL JL ginning of this chapter, before reading further. On the other hand, maybe you 
don’t really want to read the rest of this chapter at all, ever. 

/gL/gL Here now is the algorithm that TgX performs when an \ insert n is moved 
JL JL from the “recent contributions” to the “current page.” (Remember that such 
a move does not mean that the insertion will actually take place; the current page will 
be backed up later, to the breakpoint of least cost, and only the insertions preceding 
that breakpoint will actually be performed.) Let g and t be the current \pagegoal 
and \pagetotal; let q be the \insertpenalties accumulated for the current page; 
and let d and s be the current \pagedepth and \page shrink. (The value of d is at 
most \maxdepth; this value has not yet been incorporated into t.) Finally, let x be 
the natural height plus depth of the \ insert n that we are moving to the current page; 
and let / be the corresponding magnification factor, i.e., \count n divided by 1000. 

Step 1. If there is no previous \insert n on the current page, decrease g by hf + w, 
where h is the current height plus depth of \box n, and where w is the natural space 
component of \skip n; also include the stretch and shrink components of \skip n in 
the totals for the current page (in particular, this affects z). 

Step 2. If a previous \insert n on the current page has been split, add the parameter 
called \f loatingpenalty to q, and omit Steps 3 and 4. 


Step 3. Test if the current insertion will fit on the page without splitting. This means 
that it will not make the height plus depth of \box n surpass \dimenn, when it is 
added to \boxn together with all previous \ insert n amounts on the current page; 
furthermore, it means that either xf < 0 or t + d + xf — z < g. If both tests are passed, 
subtract xf from g and omit Step 4. 

Step 4. (The current insertion will be split, at least tentatively; but the split will not 
actually take place if the least-cost page turns out to have occurred earlier than the 
present insertion.) First compute the largest amount v such that a height plus depth 
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of v will not make the total insertions into \box n bigger than \dimen n, and such that 
t + d + vf < g. (Notice that z is omitted from the latter formula, but the available 
shrinkability was considered in Step 3 when we tried to avoid splitting.) Then find the 
least-cost way to split the beginning of the vertical list of the insertion so as to obtain a 
box of height v. (Use an algorithm just like page-breaking, but without the complexity 
of insertion; an additional ‘\penalty- 10000’ item is assumed to be present at the end of 
the vertical list, to ensure that a legal breakpoint exists.) Let u be the natural height 
plus depth of that least-cost box, and let r be the penalty associated with the optimum 
breakpoint. Decrease g by i if, and increase q by r. (If \tracingpages=l, the log file 
should now get a cryptic message that says 7, split n to v,u p =r\ For example, 


7. split254 to 180.2,175.3 p=100 

means that the algorithm has tried to split an \insert254 to 180.2 pt; the best split is 
actually 175.3 pt tall, and the penalty for breaking there is 100.) 

/gb/gb This algorithm is admittedly complicated, but no simpler mechanism seems 
JL JL to do nearly as much. Notice that penalties of —10000 inside insertions will 
make certain splits very attractive in Step 4, so the user can provide hints about 
where to break, in difficult situations. The algorithm provides a variety of different 
behaviors: Floating insertions can be accommodated as a special case of split insertions, 
by making each floating topinsert start with a small penalty, and by having zero as the 
associated \floatingpenalty; non-floating insertions like footnotes are accommodated 
by associating larger penalties with split insertions (see Appendix B). 

/>)/>) The splitting operation mentioned in Step 4 is also available as a primitive: 
JL JL l \vsplit(number) to(dimen)’ produces a vbox obtained by splitting off a 
specified amount of material from a box register. For example, 


\setbox200=\vsplitl00 to 50pt 

sets \box200 to a vbox whose height is 50 pt; it goes through the vertical list inside 
\boxl00 (which should be a vbox) and finds the least-cost, break assuming a goal height 
of 50 pt, considering badnesses and penalties just as in the case of page-breaking (but 
with q = 0). The algorithm uses \splitmaxdepth instead of \maxdepth to govern 
the maximum depth of boxes. Then it prunes the top of \boxl00 by removing every- 
thing up to and including any discardable items that immediately follow the optimum 
breakpoint; and it uses \splittopskip to insert new glue before the first box inside 
\boxl00, just as \topskip glue appears at the top of a page. However, if the optimum 
breakpoint occurs at the end of the vertical list inside \boxl00 — a ‘\penalty-10000’ 
item is assumed to be present there — or if all items after the optimum breakpoint are 
discarded, \boxl00 will be void after the \vsplit. And if \boxl00 was void before the 
\vsplit, both \boxl00 and \box200 will be void afterwards. 

/>)/>) You’d better not change \box?i, \count n, \dimen n, or \skip n while TjjX is 
JL JL contributing insertions to the current page, since TjjX’s algorithm assumes that 
those quantities are static. But you can change \f loatingpenalty, \splittopskip, 
and \splitmaxdepth; Tj^X will use the values that were current just inside the closing 
right brace of ‘\insert n{. . when it splits and floats insertions. For example, Ap- 
pendix B uses \floatingpenalty=20000 in footnote insertions, to discourage footnotes 
that split before others can start, but \f loatingpenalty=0 in floating topinserts. Ap- 
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pendix B also uses special values of \splittopskip and \splitmaxdepth, together with 
struts, so that split footnotes will be typeset with the same spacing as unsplit ones. 


/>)/>) The \footnote macro puts an \insert into the horizontal list of a paragraph. 
JL JL After the paragraph has been broken into lines, this insertion will move out 
into the vertical list just after the line that contained it (see Chapter 14). Since there 
is no legal breakpoint between that box (i.e. , that line) and the insertion, TjjX will put 
the insertion onto the page that contains the line that contains the insertion. 


^►EXERCISE 15.13 

Study the page-breaking algorithm carefully, 
might not appear on the same page as its reference? 


Is it possible that a footnote 


/>>/>) When the best page break is finally chosen, TjgX removes everything after 
JL JL the chosen breakpoint from the bottom of the “current page,” and puts it 
all back at the top of the “recent contributions.” The chosen breakpoint itself is 
placed at the very top of the recent contributions. If it is a penalty item, the value 
of the penalty is recorded in \outputpenalty and the penalty in the contribution list 
is changed to 10000; otherwise \outputpenalty is set to 10000. The insertions that 
remain on the current page are of three kinds: For each class n there are unsplit 
insertions, followed possibly by a single split insertion, followed possibly by others. If 
\holdinginserts >0, all insertions remain in place (so that they might be contributed 
again); otherwise they are all removed from the current page list as follows: The unsplit 
insertions are appended to \box n, with no interline glue between them. (Struts should 
be used, as in the \vfootnote macro of Appendix B.) If a split insertion is present, it 
is effectively \vsplit to the size that was computed previously in Step 4; the top part 
is treated as an unsplit insertion, and the remainder (if any) is converted to an insertion 
as if it had not been split. This remainder, followed by any other floating insertions of 
the same class, is held over in a separate place. (They will show up on the “current 
page” if \showlists is used while an \output routine is active; the total number of 
such insertions appears in \insertpenalties during an \output routine.) Finally, the 
remaining items before the best break on the current page are put together in a \vbox 
of height g , where g was the \pagegoal at the time of the break, using the saved value 
of \maxdepth; this box becomes \box255. Now the user’s \output routine enters T^X’s 
scanner (see Chapter 23); its duty is to assemble the final pages based on the contents of 
\box255 and any insertion boxes that it knows about. The output routine will probably 
unbox those boxes, so that their glue can be reset; the glue in insertion boxes usually 
cooperates nicely with the glue on the rest of the page, when it is given a chance. After 
the \ output routine is finished, lield-over insertion items are placed first on the list of 
recent contributions, followed by the vertical list constructed by \output, followed by 
the recent contributions beginning with the page break. (Deep breath. ) You got that? 


Since it is impossible to foresee how [footnotes] will happen to come out 
in the make-up, it is impracticable to number them from 1 up on each page. 
The best way is to number them consecutively throughout an article 

or by chapters in a book. 
— UNIVERSITY OF CHICAGO PRESS, Manual of Style (1910) 


125 


struts 

outputpenalty 

holdinginserts 

Struts 

showlists 

output 

insertpenalties 

maxdepth 

output 

box255 

held-over insertion 

CHICAGO 

KNUTH 


Don't use footnotes in your books, Don. 

— JILL KNUTH (1962) 
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TgX is designed to handle complex mathematical expressions in such a way that 
most of them are easy to input. The basic idea is that a complicated formula 
is composed of less complicated formulas put together in a simple way; the less 
complicated formulas are, in turn, made up of simple combinations of formulas 
that are even less complicated; and so on. Stating this another way, if you know 
how to type simple formulas and how to combine formulas into larger ones, you 
will be able to handle virtually any formula at all. So let’s start with simple ones 
and work our way up. 

The simplest formula is a single letter, like ‘x’, or a single number, like 
‘2’. In order to put these into a T 13 X text, you type ‘$x$’ and ‘$2$’, respectively. 
Notice that all mathematical formulas are enclosed in special math brackets; we 
are using $ as the math bracket in this manual, in accord with the plain TfgX 
format defined in Appendix B, because mathematics is supposedly expensive. 

When you type ‘$x$’ the ‘x’ comes out in italics, but when you type 
‘$2$’ the ‘2’ comes out in roman type. In general, all characters on your key- 
board have a special interpretation in math formulas, according to the normal 
conventions of mathematics printing: Letters now denote italic letters, while 
digits and punctuation denote roman digits and punctuation; a hyphen (-) now 
denotes a minus sign (— ), which is almost the same as an em-dash but not quite 
(see Chapter 2). The first $ that you type puts you into “math mode” and the 
second takes you out (see Chapter 13). So if you forget one $ or type one $ too 
many, TpX will probably become thoroughly confused and you will probably get 
some sort of error message. 

Formulas that have been typeset by a printer who is unaccustomed to 
mathematics usually look quite strange to a mathematician, because a novice 
printer usually gets the spacing all wrong. In order to alleviate this problem, T^X 
does most of its own spacing in math formulas; and it ignores any spaces that you 
yourself put between $’s. For example, if you type '$ x$’ and 2 they will 
mean the same thing as ‘$x$’ and ‘$ 2 $’. You can type ‘$(x + y)/(x - y)$’ or 
‘$(x+y) / (x-y)$’, but both will result in ‘(x + y)/(x — I/)', a formula in which 
there is a bit of extra space surrounding the + and — signs but none around 
the / sign. Thus, you do not have to memorize the complicated rules of math 
spacing, and you are free to use blank spaces in any way you like. Of course, 
spaces are still used in the normal way to mark the end of control sequences, 
as explained in Chapter 3. In most circumstances T^X’s spacing will be what a 
mathematician is accustomed to; but we will see in Chapter 18 that there are 
control sequences by which you can override TgX’s spacing rules if you want to. 

One of the things mathematicians like to do is make their formulas look 
like Greek to the uninitiated. In plain TgX language you can type ‘$$\alpha, 
\beta, \gamma, \delta;$$’ and you will get the first four Greek letters 

a,0,j,6‘ } 

furthermore there are uppercase Greek letters like T’, which you can get by 
typing ‘$\Gamma$’. Don’t feel intimidated if you aren’t already familiar with 
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Greek letters; they will be easy to learn if you need them. The only difficulty 
is that some symbols that look nearly the same must be carefully distinguished. 
For example, the Greek letters \nu (n) and \kappa (k) should not be confused 
with the italic letters v and x ; the Greek \phi (<p) is different from the slashed 
zero called \emptyset (0). A lowercase epsilon (e) is quite different from the 
symbol used to denote membership in a set (g); type ‘$\epsilon$’ for e and 
‘$\in$’ for G. Some of the lowercase Greek letters have variant forms in plain 
TgX’s math italic fonts: ‘$(\phi,\'theta,\epsilon,\rho)$’ yields p)’ 

while ‘$(\varphi,\vartheta,\varepsilon,\varrho)$’ yields ‘(<p, $,£, p)’. 

Besides Greek letters, there are a lot of funny symbols like ‘rj’ (which 
you get by typing ‘$\approx$’) and ‘n-’ (which you get by typing ‘$\mapsto$’). 
A complete list of these control sequences and the characters they correspond to 
appears in Appendix F. Such control sequences are allowed only in math mode, 
i.e., between $’s, because the corresponding symbols appear in the math fonts. 

► EXERCISE 16.1 

What should you type to get the formula ‘7 + v G r’ ? 

► EXERCISE 16.2 

Look at Appendix F to discover the control sequences for ‘<’, “>’, and ‘yf’. 
(These are probably the three most commonly used math symbols that are not 
present on your keyboard.) What does plain T^X call them? 

Now let’s see how the more complex formulas get built up from simple 
ones. In the first place, you can get superscripts (uphlgh * and subscripts (down low) 
by using and as shown in the following examples: 
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indices, see subscripts 
superiors, see superscripts 
inferiors, see subscripts 


Input Output 

$x~2$ x' 2 

$x_2$ X-2 

$2~x$ 2® 

$x~2y~2$ x 2 y 2 

$x ~ 2y ~ 2$ x 2 y 2 

$x_2y_2$ x- 2 V 2 

$_2F_3$ 2 F 3 


Notice that ~ and _ apply only to the next single character. If you want several 
things to be superscripted or subscripted, just enclose them in braces: 


$x~{2y}$ 

x 2y 

$2~{2~x}$ 

I 2 * 

$2~{2~{2~x}}$ 

2 2 

$y_{x_2}$ 

y. t'2 

$y_{x~2}$ 

Vx* 
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The braces in these examples have been used to specify “subformulas,” i.e., 
simpler parts of a larger formula. Tj^X makes a box for each subformula, and 
treats that box as if it were a single symbol. Braces also serve their usual purpose 
of grouping, as discussed in Chapter 5. 

It is illegal to type ‘x~y~z’ or ‘x_y_z’; T^X will complain of a “double 
superscript” or “double subscript.” You must type ‘x~{y~z}’ or ‘x~{yz}’ or 
‘x_{y_z}’ or ‘x_{yz}’ in order to make your intention clear. 

A superscript or subscript following a character applies to that character 
only; but when following a subformula it applies to that whole subformula, and 
it will be raised or lowered accordingly. For example, 

$ ( (x~2) ~3) ~4$ ((x 2 ) 3 ) 4 

${ ({ (x~2) }~3) }~4$ ((x 2 ) 3 ) 4 


In the first formula the ‘“3’ and ‘"4’ are superscripts on the right parentheses, 
i.e., on the ‘)’ characters that immediately precede them, but in the second 
formula they are superscripts on the subformulas that are enclosed in braces. 
The first alternative is preferable, because it is much easier to type and it is just 
as easy to read. 

f A subscript or superscript following nothing (as in the ‘_2F_3’ example on 
the preceding page, where the ‘_2’ follows nothing) is taken to mean a sub- 
script or superscript of an empty subformula. Such notations are (fortunately) rare in 
mathematics; but if you do encounter them it is better to make your intention clear by 
showing the empty subformula explicitly with braces. In other words, the best way to 
get S> ' in a formula is to type 1 {}_2F_3’ or ‘{_2}F_3’ or ‘{_2F_3}’. 


►EXERCISE 16.3 

J_ What difference, if any, is there between the output of ‘$x + _2F_3$’ and the 
output of ‘$x + {}_2F_3$’ ? 



► EXERCISE 16.4 

Describe the differences between the outputs of ‘${x~y}"z$’ and ‘$x"{y“z}$’. 


You can have simultaneous subscripts and superscripts, and you can 


specify them in any order: 


$x~2_3$ 

x 3 

$x_3~2$ 

H 

$x~{31415}_{92}+\pi$ 

~31415 

x 92 

$x_{y~a_b}~{z_c~d}$ 

z d c 

x ya 


Notice that simultaneous subscripts are positioned over each other. However, a 
subscript will be “tucked in” slightly when it follows certain letters; for example, 
•$P_2~2$’ produces ‘P 2 2 ’- If f° r some reason you want the left edges of both 
subscript and superscript to be aligned, you can fool TgX by inserting a null 
subformula: ‘$P{}_2~2$’ produces ‘P 2 ’. 


subformulas 
right parentheses 
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The control sequence \prime stands for the symbol which is used 
mostly in superscripts. In fact, */’ is so big as it stands that you would never 
want to use it except in a subscript or superscript, where it occurs in a smaller 
size. Here are some typical examples: 


Input 

Output 

$y_l~\prime$ 

y[ 

$y_2~{\prime\prime}$ 

vi 

$y_3~{\prime\prime\prime}$ 

y'i 1 


Since single and double primes occur rather frequently, plain T^X provides a 
convenient abbreviation: You can simply type ’ instead of ~\prime, and ’ ’ 
instead of "{\prime\prime}, and so on. 


$f ’ [g(x)]g’ (x)$ 
$y_l’+y_2’ ’$ 

$y ’ _l+y ’ ’ _2$ 

$y’ ’ ’ _3+g’ ~2$ 


f'[g(x)\g'(x) 
y'i + 1/2 
y'i + 1/2 

y'i' + g ' 2 


►EXERCISE 16.5 

JL Why do you think treats \prime as a large symbol that appears only in 
superscripts, instead of making it a smaller symbol that has already been shifted up 
into the superscript position? 


►EXERCISE 16.6 

JL Mathematicians sometimes use “tensor notation” in which subscripts and su- 
perscripts are staggered, as in l RT k i\ Explain how to achieve such an effect. 


Another way to get complex formulas from simple ones is to use the con- 
trol sequences \sqrt, \underline, or \overline. Like ~ and _, these operations 
apply to the character or subformula that follows them: 


$\sqrt2$ 

n/2 

$\sqrt{x+2}$ 

sjx + 2 

$\underline4$ 

4 

$\overline{x+y}$ 

x + y 

$\overline x+\overline y$ 

x + y 

$x~{\underline n}$ 

X— 

$x~{\overline{m+n}}$ 

x m+n 

$\sqrt{x~3+\sqrt\alpha}$ 

\J X 3 + y/a 


You can also get cube roots l \f~' and similar things by using \root: 


^/2 


prime 

tensor notation 
sqrt 

underline 

overline 

surds, see sqrt 

vinculum, see overline 

root 


$\root 3 \of 2$ 

$\root n \of {x~n+y~n}$ 
$\root n+1 \of a$ 


+i. 
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f Tlie \sqrt and \underline and \overline operations are able to place lines 
above or below subformulas of any size or shape; the bar lines change their size 
and position, so that they are long enough to cover the subformula, and high enough 
or low enough not to bump into it. For example, consider ‘\overline 1’ (I) versus 
‘\overline m’ (fn): the first has a shorter bar line, and this line has been raised higher 
than the bar in the second. Similarly, the bar in ‘\underline y’ (y) is lower than 
the bar in ‘\underline x’ (x); and square root signs appear in a variety of positions 
based on the height and depth of what is being Xsqrt’d: C a + Vd. + y . TgX knows 
the height, depth, and width of every letter and every subformula, because it considers 
them to be boxes, as explained in Chapter 11. If you have a formula in which there 
is only one \sqrt, or only one \overline or \underline, the normal positioning rules 
work fine; but sometimes you want to have uniformity between different members of a 
complex formula. For example, you might want to typeset i -\fa + \fd + \fy\ putting 
all square roots in the same vertical position. There’s an easy way to do this, using the 
control sequence \mathstrut as follows: 

$\sqrt{\mathstrut a}+\sqrt{\mathstrut d}+\sqrt{\mathstrut y}$. 

A \mathstrut is an invisible box whose width is zero; its height and depth are the 
height and depth of a parenthesis ‘(’. Therefore subformulas that contain \mathstrut 
will always have the same height and depth, unless they involve more complicated 
constructions like subscripts and superscripts. Chapter 18 discusses more powerful 
operations called \smash and \phantom by which you can obtain complete control over 
the positioning of roots and similar signs. 

► EXERCISE 16.7 

Test your understanding of what you have read so far in this chapter by explain- 
ing what should be typed to get the following formulas. (Be sure to check your 
answer with Appendix A to confirm that you’re right.) 

10 10 2 n+1 (n + l) 2 a/1 - -t 2 wTl pi 1 a bcd | /h”{ax) 

► EXERCISE 16.8 

What mistake did B. C. Dull discover after he typed the following? 

If$ x = y$ , then $x$ is equal to $y.$ 

► EXERCISE 16.9 

Explain how to type the following sentence: 

Deleting an element from an ?r-tuple leaves an (n — l)-tuple. 

► EXERCISE 16.10 

List all the italic letters that descend below the baseline. (These are the letters 
for which \underline will lower its bar line.) 

We have discussed the fact that the characters you type have special 
meanings in math mode, but the examples so far are incomplete; they don’t 
reveal all the power that is at your fingertips just after you press the ‘$’ key. It’s 
time now to go back to basics: Let us make a systematic survey of what each 
character does, when it is used in a formula. 


mathstrut 

smash 

phantom 

Dull 
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The 52 letters (A to Z and a to z) denote italic symbols (.4 to Z and 
a to z), which a mathematician would call “variables.” TpX just calls them 
“ordinary symbols,” because they make up the bulk of math formulas. There 
are two variants of lowercase L in plain TgX, namely T (which you get by simply 
typing T’) and L V (which you get by typing ‘\ell’). Although mathematicians 
commonly write something that looks like T in their manuscripts, they do so 
only to distinguish it from the numeral T’. This distinguishability problem is 
not present in printed mathematics, since an italic T is quite different from a T’; 
therefore it is traditional to use T unless T has been specifically requested. 

Plain TpX also treats the 18 characters 

0123456789!?. I / ‘ 0 " 


letters 
variables 
ordinary symbols 
ell 

digits 
numerals 
vertical line 
big-0 notation 
binary operations 
plus sign 
minus sign 
asterisk 

star, see asterisk 

hyphen 

slash 


as ordinary symbols; i.e. , it doesn’t insert any extra space when these symbols 
occur next to each other or next to letters. Unlike the letters, these 18 characters 
remain in roman type when they appear in formulas. There’s nothing special for 
you to remember about them, except that the vertical line 1 1 ’ has special uses 
that we shall discuss later. Furthermore, you should be careful to distinguish 
between ‘oh’ and ‘zero’: The italic letter O is almost never used in formulas 
unless it appears just before a left parenthesis, as in ‘O(n)’; and the numeral 0 is 
almost never used just before a left parenthesis unless it is preceded by another 
digit, as in ‘10(n — 1)’. Watch for left parentheses and you’ll be OK. (Lowercase 
o’s also tend to appear only before left parentheses; type ‘x_0’ instead of ‘x_o’, 
since the formula ‘.To’ is generally more correct than ‘x 0 ’.) 

The three characters +, -, and * are called “binary operations,” because 
they operate on two parts of a formula. For example, + is a plus sign, which is 
used for the sum of two numbers; - is a minus sign. The asterisk (*) is rarer in 
mathematics, but it also behaves as a binary operation. Here are some exam- 
ples of how TgX typesets binary operations when they appear next to ordinary 
symbols: 


Input 

$x+y-z$ 

$x+y*z$ 

$x*y/z$ 


Output 
x + y — z 
x + y * z 
x * yfz 


Notice that - and * produce quite different math symbols from what you get in 
normal text: The hyphen (-) becomes a minus sign (— ), and the raised aster- 
isk (*) drops down to a lower level (*). 

f Tf/X does not treat / as a binary operation, even though a slash stands for 
division (which qualifies as a binary operation on mathematical grounds). The 
reason is that printers traditionally put extra space around the symbols +, — , and *, 
but not around /. If TgX were to typeset / as a binary operation, the formula ‘$1/2$’ 
would come out T / 2’, which is wrong; so TLX considers / to be an ordinary symbol. 
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Appendix F lists many more binary operations, for which you type control 
sequences instead of single characters. Here are some examples: 


$x\times y\cdot z$ 
$x\circ yXbullet z$ 
$x\cup y\cap z$ 
$x\sqcup yXsqcap z$ 
$x\vee yXwedge z$ 
$x\pm y\mp z$ 


x x y ■ z 
x o y • z 
illt/flz 
x U y n z 
x V y A a 
x ± y =F s 


It is important to distinguish x (Xtimes) from A' (X) and from x (x); to distinguish U 
(Xcup) from U (U) and from u (u); to distinguish V (Xvee) from V (V) and from v (v); 
to distinguish o (\circ) from O (0) and from o (o). The symbols ‘V’ and ‘A’ can also 
be called \lor and \land, since they frequently stand for binary operations that are 
called “logical or" and “logical and." 


f Incidentally, binary operations are treated as ordinary symbols if they don’t 
occur between two quantities that they can operate on. For example, no extra 
space is inserted next to the +, — , and * in cases like the following: 


$x=+l$ 


x = +1 


$3,142-$ 3.142- 

$(D*)$ (£>*) 


Consider also the following examples, which show that binary operations can be used 
as ordinary symbols in superscripts and subscripts: 



$K_n~ + ,K_n“-$ K+,K~ 

$z“*_{i j}$ z*j 

$g~\circ \mapsto g“\bullet$ g° i— )• g * 

$f-*(x) Xcap f.*(y)$ /* (x) l~l /« (y) 

► EXERCISE 16.11 

How would you obtain the formulas ‘s“ 2 ’ and ‘hi(s)’? 


Plain TgX treats the four characters =, <, >, and : as “relations” because 
they express a relationship between two quantities. For example, l x < y’ means 
that x is less than y. Such relationships have a rather different meaning from 
binary operations like +, and the symbols are typeset somewhat differently: 


$x=y>z$ 

$x:=y$ 

$x\le y\ne z$ 

$x\sim yXsimeq z$ 
$x\equiv y\not\equiv z$ 
$x\subset yXsubseteq z$ 


x = y > z 
x := y 
x <y ^ z 
x ~ y ~ z 
x=y ^ z 
x C y C z 


(The last several examples show some of the many other relational symbols that 
plain TgX makes available via control sequences; see Appendix F.) 
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times 

cup 

vee 

circ 

cdot 

bullet 

cap 

sqcup 

sqcap 

wedge 

cross, see dagger, times 

pm 

mp 

lor 

land 

logical or 
logical and 
relations 
le 
ne 

simeq 

colon 

equals 

lessthan 

greaterthan 

colonequals 

equiv 

not 

subset 

subseteq 

sim 

hooks, see subset, supset 
wiggle, see sim 
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The two characters V (comma) and (semicolon) are treated as 
punctuation marks in formulas; this means that TgX puts a little extra space 
after them, but not before them. 

$f(x,y;z)$ f(x,y;z ) 

It isn’t customary to put extra space after a ‘ . ’ (period) in math formulas, so 
TgX treats a period as an ordinary symbol. If you want the 1 : ’ character to be 
treated as a punctuation mark instead of as a relation, just call it \colon: 


$f :A\to B$ / : A ->• B 

$f\colon A\to B$ /: A — )• 13 

If you want to use a comma as an ordinary symbol (e.g., when it appears in a 
large number), just put it in braces; TgX treats anything in braces as an ordinary 
symbol. For instance, 



$12,345x$ 12,345a; (wrong) 

$12{,}345x$ 12,345a; (right) 

► EXERCISE 16.12 

What’S an easy way to get a raised dot in a decimal constant (e.g., ‘3-1416’)? 


So far we have considered letters, other ordinary symbols, binary oper- 
ations, relations, and punctuation marks; hence we have covered almost every 
key on the typewriter. There are just a few more: The characters *'(’ and 1 [’ are 
called “openings,” while “)’ and ‘] ’ are called “closings”; these act pretty much 
like ordinary symbols, but they help TgX to decide when a binary operation is 
not really being used in a binary way. Then there is the character ’ , which we 
know is used as an abbreviation for \prime superscripts. Finally, we know that 
plain TgX reserves the other ten characters: 




These are not usable for symbols in math mode unless their \ cat code values 
are changed (see Chapter 7). Although { and } specify grouping, the control 
sequences and ! \}’ can be used to get as an opening and ‘}’ as a closing. 

/gK/gK All of these math mode interpretations are easily changeable, since each char- 
JL JL acter has a \mathcode, as explained in Chapter 17; none of the conventions are 
permanently built into TjrX. However, most of them are so standard that it is usually 
unwise to make many changes, except perhaps in the interpretations of and @. 

The special characters ~ and _ that designate superscripts and subscripts 
should not be used except in formulas. Similarly, the names of math symbols 
like \alpha and \approx, and the control sequences for math operations like 
\overline, must not invade ordinary text. TgX uses these facts to detect missing 
dollar signs in your input, before such mistakes cause too much trouble. For 
example, suppose you were to type 


comma 

semicolon 

punctuation marks in formulas 

period 

colon 

to 

lparen 

lbracket 

openings 

rparen 

fences, see opening, closing, delin 

rbracket 

closings 

catcode 

lbrace 

rbrace 

mathcode 

at sign 

leftquote 

doublequote 

circumflex 

underbar 

missing dollar signs 


The smallest $n such that $2~n>1000$ is~10. 
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TgX doesn’t know that you forgot a ‘$’ after the first ‘n’, because it doesn’t 
understand English; so it finds a “formula” between the first two $ signs: 

The smallest nsuchthat 


after which it thinks that ‘2’ is part of the text. But then the ~ reveals an 
inconsistency; T^X will automatically insert a $ before the ", and you will get 
an error message. In this way the computer has gotten back into synch, and the 
rest of the document can be typeset as if nothing had happened. 



Conversely, a blank line or \par is not permitted in math mode. This gives 
Tp;X another way to recover from a missing $; such errors will be confined to 


the paragraph in which they occur. 


f lf for some reason you cannot use * and _ for superscripts and subscripts, 
because you have an unusual keyboard or because you need “ for French 
accents or something, plain TfrX lets you type \sp and \sb instead. For example, 
l $x\sp2$’ is another way to get ‘x 2 \ On the other hand, some people are lucky enough 
to have keyboards that contain additional symbols besides those of standard ASCII. 
When such symbols are available, TgX can be set up to make math typing a bit more 
pleasant. For example, at the author’s installation there are keys labeled t and 4- that 
produce visible symbols (these make superscripts and subscripts look much nicer on 
the screen); there are keys for the relations <, >, and # (these save time); and there are 
about two dozen more keys that occasionally come in handy. (See Appendix C.) 


f Mathematicians are fond of using accents over letters, because this is often 
an effective way to indicate relationships between mathematical objects, and 
because it greatly extends the number of available symbols without increasing the 
number of necessary fonts. Chapter 9 discusses the use of accents in ordinary text, but 
mathematical accents are somewhat different, because spacing is not the same; TgX 
uses special conventions for accents in formulas, so that the two sorts of accents will not 
be confused with each other. The following math accents are provided by plain TfjX: 


$\hat a$ a 
$\check a$ a 
$\tilde a$ a 
$\acute a$ a 
$\grave a$ a 
$\dot a$ a 
$\ddot a$ a 
$\breve a$ a 
$\bar a$ a 
$\vec a$ a 


The first nine of these are called \~, \v, \~, V, V, \., V, \u, and \=, respectively, 
when they appear in text; \vec is an accent that appears only in formulas. TEX will 
complain if you try to use \* or \v, etc., in formulas, or if you try to use \hat or 
\check, etc., in ordinary text. 


par 

sp 

sb 

character set 

uparrow 

downarrow 

leq 

geq 

neq 

accents 

hat 

check 

tilde 

acute 

grave 

dot 

ddot 

breve 

bar 

vec 
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f it’s usually a good idea to define special control sequences for accented letters 
that you need frequently. For example, you can put 

\def\Ahat{{\hat A}} 

\def\chat{{\hat c}} 

\def\scheck{{\check s}} 

\def\xtilde{{\tilde x}} 

\def\zbar{{\bar z}} 

at the beginning of a manuscript that uses the symbols A, c, s, x, and Z more than, 
say, five times. This saves you a lot of keystrokes, and it makes the manuscript easier 
to read. Chapter 20 explains how to define control sequences. 

f When the letters i and j are accented in math formulas, dotless symbols i 
and j should be used under the accents. These symbols are called \imath and 
\jmath in plain TpX. Thus, for example, a paper that uses T and ‘j’ ought to begin 
with the following definitions: 

\def\ihat{{\hat\imath}} 

\def\ j hat {{\hat\j math}} 

f You can put accents on top of accents, making symbols like A that might cause 
a mathematician to squeal with ecstasy. However, it takes a bit of finesse to 
get the upper accent into a position that looks right, because the designer of a font 
for mathematics usually tells TpX to position math accents in special ways for special 
letters. Plain Tj'X provides a control sequence called \skew that makes it fairly easy to 
shift superaccents into their proper place. For example, ‘\skew6\hat\Ahat’ was used 
to produce the symbol above. The number ‘6’ in this example was chosen by trial and 
error; ‘5’ seems to put the upper accent a bit too far left, while ‘7’ makes it a bit too 
far right, at least in the author’s opinion. The idea is to fiddle with the amount of skew 
until you find what pleases you best. 

f it’s possible, in fact, to put math accents on any subformula, not just on 
single characters or accented characters. But there’s usually not much point 
in doing so, because TjrjX just centers the accent over the whole subformula. For 
example, ‘$\hat{I+M}$’ yields ‘I 4- M ’. In particular, a \bar accent always stays the 
same size; it’s not like \overline, which grows with the formula under it. Some people 
prefer the longer line from \overline even when it applies to only a single letter; for 
example, ‘$\bar z+\overline z$’ produces ‘Z + «’, and you can take your pick when 
you define \zbar. However, plain Tf^X does provide two accents that grow; they are 
called \widehat and \widetilde: 

$\widehat x,\widetilde x$ x, X 

$\widehat{xy} ,\widetilde{xy}$ xy, xy 

$\widehat{xyz} , \widetilde{xyz}$ xyz, xyz 

The third example here shows the maximum size available. 


► EXERCISE 16.13 

This has been another long chapter; but cheer up, you have learned a lot! Prove 
it by explaining what to type in order to get the formulas e~ x ~ , D ~ p a M + l , 


def 

dotless 

imath 

jmath 

accents on top of accents 

skew 

overline 

widehat 

widetilde 
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and g £ (H n i 1 )'. (In the last example, assume that a control sequence \ghat 
has already been defined, so that \ghat produces the accented letter g.) 


Greek 

LAMPORT 

IfTfcX 

SPIVAK 


Producing Greek letters is as easy as w. 
You just type ... as easy as $\pi$. 

— LESLIE LAMPORT, The LpT E X Document Preparation System (1983) 

T E X has no regard for the glories of the Greek tongue — 
as far as it is concerned, Greek letters are just additional weird symbols, 

and they are allowed only in math mode. 
In a pinch you can get the output re\ by typing $\tau\epsilon\chi$, 
but if you're actually setting Greek text, you will be using 
a different version of T E X, designed for a keyboard with Greek letters on it, 

and you shouldn't even be reading this manual, 
which is undoubtedly all English to you. 
— MICHAEL SPIVAK, The Joy Of T E X (1982) 
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Another thing mathematicians like to do is make fractions — and they like to 
build symbols up on top of each other in a variety of different ways: 

1 and and (" + ^ and W. 

2 3 \ 3 J n 

' n = 1 

You can get these four formulas as displayed equations by typing ‘$$l\over2$$’ 
and ‘$$n+l\over3$$’ and ‘$$n+l\choose3$$’ and ‘$$\sum_{n=l}~3 Z_n~2$$’; 
we shall study the simple rules for such constructions in this chapter. 

First let’s look at fractions, which use the ‘\over’ notation. The control 
sequence \over applies to everything in the formula unless you use braces to 
enclose it in a specific subformula; in the latter case, \over applies to everything 
in that subformula. 


Input 

$$x+y~2\over k+l$$ 

$${x+y~2\over k}+l$$ 
$$x+{y~2\over k}+l$$ 
$$x+{y~2\over k+l}$$ 
$$x+y~{2\over k+l}$$ 


Output 

x + y 2 
k + 1 


x + y 


+ 1 


x + —— + 1 

k 


x + 


r 


k + 1 

2 

x + y k + 1 


You aren’t allowed to use \over twice in the same subformula; instead of typing 
something like ‘a \over b \over 2’, you must specify what goes over what: 

a_ 

$${a\over b}\over 2$$ \ 


$$a\over{b\over 2}$$ — 

2 


Unfortunately, both of these alternatives look pretty awful. Mathematicians 
tend to “overuse” \over when they first begin to typeset their own work on a 
system like Tj^X. A good typist or copy editor will convert fractions to a “slashed 
form,” whenever a built-up construction would be too small or too crowded. For 
example, the last two cases should be treated as follows: 


$$a/b \over 2$$ 


a/b 


$$a \over b/2$$ — — 

0 1 £ 

Conversion to slashed form takes a little bit of mathematical knowhow, since 
parentheses sometimes need to be inserted in order to preserve the meaning of 


sum 

choose 

fractions 

over 

stacked fractions, see over 
slashed form 
parentheses 
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the formula. Besides substituting 7’ for ‘\over’, the two parts of the fraction 
should be put in parentheses unless they are single symbols; for example, | be- 
comes simply a/6, but becomes ( a,+ 1 )/b, and fyf- becomes (a + 1 )/(b -F 1). 
Furthermore, the entire fraction should generally be enclosed in parentheses if 
it appears next to something else; for example, jx becomes ( a/b)x . If you are 
a typist without mathematical training, it’s best to ask the author of the manu- 
script for help, in doubtful cases; you might also tactfully suggest that unsightly 
fractions be avoided altogether in future manuscripts. 

► EXERCISE 17.1 

2 

What’s a better way to render the formula x + y k + 1 ? 

► EXERCISE 17.2 

Convert ‘fyyx’ to slashed form. 

► EXERCISE 17.3 

What surprise did B. L. User get when he typed ‘$$x = (y~2\over k+l)$$’? 

► EXERCISE 17.4 

How can you make (Assume that the control sequence \cents yields l f.) 

The examples above show that letters and other symbols sometimes get 
smaller when they appear in fractions, just as they get smaller when they are 
used as exponents. It’s about time that we studied TpX’s method for choosing 
the sizes of things. TpX actually has eight different styles in which it can treat 
formulas, namely 

display style (for formulas displayed on lines by themselves) 

text style (for formulas embedded in the text) 

script style (for formulas used as superscripts or subscripts) 

scriptscript style (for second-order superscripts or subscripts) 

and four other “cramped” styles that are almost the same except that exponents 
aren’t raised quite so much. For brevity we shall refer to the eight styles as 

D , D 1 , T, T\ S, S', 55, 55', 

where D is display style, D' is cramped display style, T is text style, etc. TpX 
also uses three different sizes of type for mathematics; they are called text size, 
script size, and scriptscript size. 

The normal way to typeset a formula with TgX is to enclose it in dollar 
signs $...$; this yields the formula in text style (style T). Or you can enclose it in 
double dollar signs $$ . . . $$; this displays the formula in display style (style D). 
The subformulas of a formula might, of course, be in different styles. Once you 
know the style, you can determine the size of type that TpX will use: 

If a letter is in style then it will be set in 
D,D',T,T' text size (like this) 

5, S' script size (like this) 

55, 55' scriptscript size 


User 

money 

cents 

styles 

display style 
text style 
script style 
scriptscript style 
cramped 

sizes of type for mathematics 
text size 
script size 
scriptscript size 


(like this) 
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There is no “ SSS ” style or “scriptscriptscript” size; such tiny symbols would 
be even less readable than the scriptscript ones. Therefore TgX stays with 
scriptscript size as the minimum: 


In a formula 
of style 
D,T 
D',T' 

s,ss 

S',SS' 


the superscript 
style is 
S 
S' 

SS 

ss' 


and the subscript 
style is 
S' 

S' 

SS' 

SS' 


For example, if x~{a_b} is to be typeset in style D, then a_b will be set in 
style S , and b in style SS 1 ; the result is l x ab \ 

So far we haven’t seen any difference between styles D and T. Actually 
there is a slight difference in the positioning of exponents, although script size 
is used in each case: You get x 2 in D style and x 2 in T style and x' 2 in D' or 
T' style — do you see the difference? But there is a big distinction between D 
style and T style when it comes to fractions: 


the style of the 
numerator a is 
T 
T' 

S 

S' 

SS 

SS' 


and the style of the 
denominator J? is 
T' 

V 

S' 

S' 

ss' 

ss' 


In a formula 
a:\over ft of style 
D 
D' 

T 
T' 

S,SS 
S',SS' 

Thus if you type ‘$l\over2$’ (in a text) you get 
but if you type •$$l\over2$$’ you get 

1 

2 

(a displayed formula), which is style T over style T' . 

f While we’re at it, we might as well finish the style rules: \underline does 
not change the style. Math accents, and the operations \sqrt and \overline, 
change uncramped styles to their cramped counterparts; for example, D changes to D ' , 
but D' stays as it was. 


2 > 


namely style S over style S'; 


►EXERCISE 17.5 

J_ State the style and size of each part of the formula 
formula itself is in style D. 


! P2 , assuming that the 


Suppose you don’t like the style that T^X selects by its automatic style 
rules. Then you can specify the style you want by typing \displaystyle or 
\textstyle or \scriptstyle or \scriptscriptstyle; the style that you se- 
lect will apply until the end of the formula or subformula, or until you select 


numerator 
denominator 
underline 
Math accents 
sqrt 

overline 

displaystyle 

textstyle 

scriptstyle 

scriptscriptstyle 
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another style. For example, ‘$$n+\scriptstyle n+\scriptscriptstyle n.$$’ 
produces the display 

71 +n+n. 

This is a rather silly example, but it does show that the plus signs get smaller 
too, as the style changes. T^X puts no space around + signs in script styles. 

Here’s a more useful example of style changes: Sometimes you need to 
typeset a “continued fraction” made up of many other fractions, all of which are 
supposed to be in display style: 

1 

«o H 

ai H 

a 2 H — 

0'3 d 

a. 4 

In order to get this effect, the idea is to type 

$$a_0+{l\over\displaystyle a_l+ 

{\strut l\over\displaystyle a_2+ 

{\strut l\over\displaystyle a_3+ 

{\strut l\over a_4}}}}$$ 


(The control sequence \strut has been used to make the denominators taller; 
this is a refinement that will be discussed in Chapter 18. Our concern now is with 
the style commands.) Without the appearances of \strut and \displaystyle 
in this formula, the result would be completely different: 


ao + 


1 




These examples show that the numerator and denominator of a fraction are 
generally centered with respect to each other. If you prefer to have the nu- 


merator or denominator appear flush left, put ‘\hfilT after it; or if you prefer flush 


right, put ‘\hfilT at the left. For example, if the first three appearances of T\over’ 


in the previous example are replaced by T\hf ill\over’, you get the display 


«o d- 


1 

1 

a i H 

1 

a 2 d 

1 

«3 H 

CL 4 


continued fraction 

strut 

flush left 

hfill 

flush right 


(a format for continued fractions that many authors prefer). This works because \hf ill 
stretches at a faster rate than the glue that is actually used internally by Tl^X when it 
centers the numerators and denominators. 
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TgjX has another operation ‘\atop’, which is like \over except that it 
leaves out the fraction line: 

X 

$$x\atop y+2$$ 

y + 2 

The plain TgX format in Appendix B also defines ‘\choose’, which is like \atop 
but it encloses the result in parentheses: 

$$n\choose k$$ 



It is called \choose because it’s a common notation for the so-called binomial 
coefficient that tells how many ways there are to choose k things out of n things. 

You can’t mix \over and \atop and \choose with each other. For 
example, ! $$n \choose k \over 2$$’ is illegal; you must use grouping, to get 
either ‘$${n\choose k}\over2$$’ or ‘$$n\choose{k\over2}$$’, i.e., 



The latter formula, incidentally, would look better as ‘$$n\choose k/2$$’ or 
‘$$n\choose{l\over2}k$$’, yielding 



► EXERCISE 17.6 

( n ) 

As alternatives to discuss how you could obtain the two displays 



atop 

choose 

binomial coefficient 
above 

compound fraction 


► EXERCISE 17.7 

Explain how to specify the displayed formula 



x 2 y p 2 


1 1 

1 — x 1 — x' 2 


f T^X has a generalized version of \over and \atop in which you specify the 
exact thickness of the line rule by typing ‘\above{dimen)\ For example, 

$$\displaystyle{a\over b}\abovelpt\displaystyle{c\over d}$$ 

will produce a compound fraction with a heavier (lpt thick) rule as its main bar: 


b_ 

c 

d. 


This sort of thing occurs primarily in textbooks on elementary mathematics. 
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Mathematicians often use the sign y] to stand for “summation” and the 
sign f to stand for “integration.” If you’re a typist but not a mathematician, 
all you need to remember is that \sum stands for and \int for f ; these 
abbreviations appear in Appendix F together with all the other symbols, in case 
you forget. Symbols like and f (and a few others like |J and f[ and j> and 0, 
all listed in Appendix F) are called large operators, and you type them just as 
you type ordinary symbols or letters. The difference is that TpX will choose a 
larger large operator in display style than it will in text style. For example, 


$\sum x_n$ yields ./•„ (T style) 

$$\sum x_n$$ yields 5> (D style). 

A displayed \sum usually occurs with “limits,” i.e. , with subformulas 
that are to appear above and below it. You type limits just as if they were 
superscripts and subscripts; for example, if you want 

m 

E 

n= 1 


you type either ‘$$\sum_{n=l}"m$$’ or ‘$$\sum~m_{n=l}$$’. According to the 
normal conventions of mathematical typesetting, TpX will change this to ‘X)™ = i’ 
(i.e., without limits) if it occurs in text style rather than in display style. 

Integrations are slightly different from summations, in that the super- 
scripts and subscripts are not set as limits even in display style: 

$\int_{-\infty}~{+\infty}$ yields (T style) 

/ +oo 

(D style). 

-OO 


f Some printers prefer to set limits above and below J signs; this takes more 
space on the page, but it gives a better appearance if the subformulas are 
complex, because it keeps them out of the way of the rest of the formula. Similarly, 
limits are occasionally desirable in text style or script style; but some printers prefer 
not to set limits on displayed ^ signs. You can change IjgX’s convention by simply 
typing ‘\limits’ or ‘\nolimits 1 immediately after the large operator. For example, 


summation 

integration 

sum 

int 

large operators 

collective signs, see large operato 

sigma signs, see sum 

limits 

limits 

nolimits 

displaylimits 


$$\int\limits_0~{\pi\over2}$$ 


$$\sum\nolimits_{n=l}~m$$ 


yields 


yields 


2 

/ 



/>)/>) If you say ‘\nolimits\limits’ (presumably because some macro like \int 
J_ J_ specifies \nolimits, but you do want them), the last word takes precedence. 
There’s also a command ‘\displaylimits’ that can be used to restore TjrX’s normal 
conventions; i.e., the limits will be displayed only in styles D and D' . 
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Sometimes you need to put. two or more rows of limits under a large operator; 
you can do this with ‘\atop’. For example, if you want the displayed formula 

E 

0< i< m 
0 <j<n 

the correct way to type it is 



$$\sum_{\scriptstyleO\le i\le m\atop\scriptstyleO<j<n}P(i, j)$$ 

(perhaps with a few more spaces to make it look nicer in the manuscript file). The 
instruction ‘\scriptstyle’ was necessary here, twice — otherwise the lines ‘0 < i < m’ 
and ‘0 < j < n' would have been in scriptscript size, which is too small. This is another 
instance of a rare case where T^X’s automatic style rules need to be overruled. 


► EXERCISE 17.8 

P Q r 

How would you type the displayed formula EEE Q'ijbjkCki ? 

i = 1 j = 1 k=l 



► EXERCISE 17.9 

And how would you handle ^ ' aijbjkCki ? 

l<i<p 
l<j<q 
1 <fc<r 


Since mathematical formulas can get horribly large, TgX has to have 
some way to make ever larger symbols. For example, if you type 

$$\sqrt{l+\sqrt{l+\sqrt{l+ 

\sqrt{l+\sqrt{l+\sqrt{l+\sqrt{l+x}}}}}}}$$ 


the result shows a variety of available square-root signs: 


atop 

scriptstyle 
square-root signs 
delimiter 

fences, see delimiters 

parentheses 

braces 

pieces of symbols 


l + Y i + \J i + 

The three largest signs here are all essentially the same, except for a vertical 
segment 1 I ’ that gets repeated as often as necessary to reach the desired size; 
but the smaller signs are distinct characters found in Tp]X’s math fonts. 

A similar thing happens with parentheses and other so-called “delimiter” 
symbols. For example, here are some of the different sizes of parentheses and 
braces that plain T^X might use in formulas: 



The three largest pairs in each case are made with repeatable extensions, so they 
can become as large as necessary. 
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Delimiters are important to mathematicians, because they provide good 
visual clues to the underlying structure of complex expressions; they delimit the 
boundaries of individual subformulas. Here is a list of the 22 basic delimiters 
provided by plain TgX: 


Input 

Delimiter 

( 

left parenthesis: ( 

) 

right parenthesis: ) 

[ or \ lbrack 

left bracket: [ 

] or \rbrack 

right bracket: ] 

\{ or \lbrace 

left curly brace: { 

\} or \rbrace 

right curly brace: } 

\lf loor 

left floor bracket: [ 

\rf loor 

right floor bracket: J 

\lceil 

left ceiling bracket: [ 

\rceil 

right ceiling bracket: ] 

\ langle 

left angle bracket: ( 

\rangle 

right angle bracket: ) 

/ 

slash: / 

\backslash 

reverse slash: \ 

I or \vert 

vertical bar: 

\ I or \Vert 

double vertical bar: || 

\uparrow 

upward arrow: f 

\Uparrow 

double upward arrow: fl 

\downarrow 

downward arrow: J, 

\Downarrow 

double downward arrow: ■[[ 

\updownarrow 

up-and-down arrow: £ 

\Updownarrow 

double up-and-down arrow: ^ 


In some cases, there are two ways to get the same delimiter; for example, you 
can specify a left bracket by typing either 1 [’ or ‘\lbrack’. The latter alternative 
has been provided because the symbol ‘ [’ is not readily available on all computer 
keyboards. Remember, however, that you should never try to specify a left brace 
or right brace simply by typing or ‘}’; the { and } symbols are reserved for 
grouping. The right way is to type or ‘\}’ or ‘\lbrace’ or ‘\rbrace’. 

In order to get a slightly larger version of any of these symbols, just 
precede them by ‘\bigl’ (for opening delimiters) or ‘\bigr’ (for closing ones). 
This makes it easier to read formulas that contain delimiters inside delimiters: 


lbrack 

rbrack 

lbrace 

rbrace 

lfloor 

rfloor 

lceil 

rceil 

langle 

rangle 

backslash 

vert 

Vert 

uparrow 

Uparrow 

downarrow 

Downarrow 

up downarrow 

Updownarrow 

bent bars, see langle, rangle 
curly braces, see lbrace, rbrace 
leftbracket 
rightbracket 
leftbrace 
rightbrace 
/ 

bigl 

bigr 


Input 

$\bigl (x-s (x) \bigr) \bigl (y-s (y) \bigr) $ 
$\bigl [x-s [x] \bigr] \bigl [y-s [y] \bigr] $ 
$\bigl | | x | + | y | \bigr | $ 

$\bigl\lf loor\sqrt A\bigr\rf loor$ 


Output 

{x - s{x )) ( ; y - s(y )) 
[x ~ s[x]] [y - s [-;/]] 

1 1*1 + Ml 
L^j 
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The \big delimiters are just enough bigger than ordinary ones so that the dif- 
ference can be perceived, yet small enough to be used in the text of a paragraph. 
Here are all 22 of them, in the ordinary size and in the \big size: 

0[]{}Ljno/\iiimm 

()[]{} lj noAiii mm 

You can also type \Bigl and \Bigr to get larger symbols suitable for displays: 


These are 50% taller than their \big counterparts. Displayed formulas most 
often use delimiters that are even taller (twice the size of \big); such delimiters 
are constructed by \biggl and \biggr, and they look like this: 


[]{} 


Finally, there are \Biggl and \Biggr versions, 2.5 times as tall as the \bigl 
and \bigr delimiters: 


Bigl 

Bigr 

biggl 

biggr 

Biggl 

Biggr 

partial 

varphi 

opening 

closing 

bigm 

Bigm 

biggni 

Biggm 

relation 

bigcup 

bigcap 

verticalline 

in 

big 

Big 

bigg 

Bigg 


► EXERCISE 17.10 

Guess how to type the formula 


d 2 d 2 
dx 2 dy 2 


c p(x + iy)Y = 0, in display style, 


using \bigg delimiters for the large parentheses. (The symbols d and ip that 
appear here are called \partial and \varphi.) 

►EXERCISE 17.11 

JL In practice, \big and \bigg delimiters are used much more often than \Big 
and \Bigg ones. Why do you think this is true? 

f A \bigl or \Bigl or \biggl or \Biggl delimiter is an opening, like a left 
parenthesis; a \bigr or \Bigr or \biggr or \Biggr delimiter is a closing, like 
a right parenthesis. Plain TjrX also provides \bigm and \Bigm and \biggm and \Biggm 
delimiters, for use in the middle of formulas; such a delimiter plays the role of a relation, 
like an equals sign, so TgX puts a bit of space on either side of it. 


$\bigl(x\in A(n)\bigm| x\in B(n)\bigr)$ ( x G A(n) | X G B(n)) 

$\bigcup_n X_n\bigm\ I \bigcap_n Y_n$ (J n X n II ru» 


You can also say just \big or \Big or \bigg or \Bigg; this produces a delimiter that 
acts as an ordinary variable. It is used primarily with slashes and backslashes, as in 
the following example. 

$${a+l\over b}\bigg/{c+l\over d}$$ 


a. + 1 / C + 1 

b / d 



► EXERCISE 17.12 

What’s the professional way to type ( x + f(x)) / ( x — f{x))? (Look closely.) 
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TgX has a built-in mechanism that figures out how tall a pair of delim- 
iters needs to be, in order to enclose a given subformula; so you can use this 
method, instead of deciding whether a delimiter should be \big or \bigg or 
whatever. All you do is say 

\ left (delimi ) {subformula) \r ight (delim 2 ) 

and TpX will typeset the subformula, putting the specified delimiters at the left 
and the right. The size of the delimiters will be just big enough to cover the 
subformula. For example, in the display 

$$l+\left(l\overl-x~2\right)~3$$ 1 + 

TgX has chosen \biggl( and \biggr), because smaller delimiters would be too 
small for this particular fraction. A simple formula like ‘$\left (x\right) $’ 
yields just ‘(;r)’; thus, \left and \right sometimes choose delimiters that are 
smaller than \bigl and \bigr. 

Whenever you use \left and \right they must pair up with each other, 
just as braces do in groups. You can’t have \left in one formula and \right in 
another, nor are you allowed to type things like ‘\left(. . . Aright) . . 

or ‘\left(. . . \begingroup. . Aright) . . Aendgroup’. This restriction makes 
sense, because T^X needs to typeset the subformula that appears between \left 
and \right before it can decide how big to make the delimiters. But it is 
worth explicit mention here, because you do not have to match parentheses and 
brackets, etc., when you are not using \left and \right: TppC will not complain 
if you input a formula like ‘$[0,1)$’ or even l $) ($’ or just ‘$)$’. (And it’s a 
good thing TpjX doesn’t, for such unbalanced formulas occur surprisingly often 
in mathematics papers.) Even when you do use \left and \right, TgX doesn’t 
look closely at the particular delimiters that you happen to choose; thus, you 
can type strange things like ‘\left)’ and/or Aright (’ if you know what you’re 
doing. Or even if you don’t. 

The \over operation in the example displayed above does not involve 
the Y+’ at the beginning of the formula; this happens because \left and \right 
have the function of grouping, in addition to their function of delimiter-making. 
Any definitions that you happen to make between \left and \right will be 
local, as if braces had appeared around the enclosed subformula. 



► EXERCISE 17.13 

Use \left and \right to typeset the following display (with \phi for <j>): 


n( n ) = ^ 
k=2 


m 

k- 1 


At this point you are probably wondering why you should bother learn- 
ing about \bigl and \bigr and their relatives, when \left and \right are 
there to calculate sizes for you automatically. Well, it’s true that \left and 
\right are quite handy, but there are at least three situations in which you 


left 

right 

parentheses 

brackets 

crotchets, see brackets 

grouping 

phi 
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will want to use your own wisdom when selecting the proper delimiter size: 

(1) Sometimes \left and \right choose a smaller delimiter than you want. For 

example, we used \bigl and \bigr to produce ||x| + |y|| in one of the previous 
illustrations; \left and \right don’t make things any bigger than necessary, 
so |\left |x\right |+\left |y\right |\right |$’ yields only ‘||x| + |(/||’. 

(2) Sometimes \left and \right choose a larger delimiter than you want. This 
happens most frequently when they enclose a large operator in a display; for 
example, compare the following two formulas: 


$$\left( \sum_{k=l}~n A_k \right)$$ 


$$\biggl( \sum_{k=l}~n A_k \biggr)$$ 


,5> 

\k = 1 

cJL 

y k = 1 


The rules of \left and \right cause them to enclose the \sum together with 
its limits, but in special cases like this it looks better to let the limits hang out 
a bit; \bigg delimiters are better here. (3) Sometimes you need to break a huge 
displayed formula into two or more separate lines, and you want to make sure 
that its opening and closing delimiters have the same size; but you can’t use 
\left on the first line and \right on the last, since \left and \right must 
occur in pairs. The solution is to use \Biggl (say) on the first line and \Biggr 
on the last. 

f Of course, one of the advantages of \left and \right is that they can make 
arbitrarily large delimiters — much bigger than \biggggg! The slashes and 
angle brackets do have a maximum size, however; if you ask for really big versions of 
those symbols you will get the largest ones available. 

► EXERCISE 17.14 

Prove that you have mastered delimiters: Coerce T^X into producing the formula 

-l 


7r( n ) = 


m = 2 


m — 1 

E 

k=l 


\_(m/k)/ \m/k] \ 


f lf you type after \left or \right, instead of specifying one of the basic 
delimiters, you get a so-called null delimiter (which is blank). Why on earth 
would anybody want that, you may ask. Well, you sometimes need to produce formulas 
that contain only one large delimiter. For example, the display 


f x, if 
l -x, if 


if x > 0 
—x, if x < 0 


has a but no “}’. It can be produced by a construction of the form 

$$ I x | =\left\{ ... \right.$$ 


sum 

limits 

null delimiter 


Chapter 18 explains how to fill in the ‘ . . . ’ to finish this construction; let’s just notice 
for now that the ‘\right.’ makes it possible to have an invisible right delimiter to go 
with the visible left brace. 
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A null delimiter isn’t completely void; it is an empty box whose width is a T|X 
JL JL parameter called \nulldelimiterspace. We will see later that null delimiters 
are inserted next to fractions. Plain Tj?X sets \nulldelimiterspace=l . 2pt. 

You can type ‘<’ or ‘>’ as convenient abbreviations for \langle and 
\rangle, when Tf^X is looking for a delimiter. For example, ‘\bigl<’ is equiv- 
alent to ‘\bigl\langle’, and ‘\right>’ is equivalent to ‘\right\r angle’. Of 
course ‘<’ and “>’ ordinarily produce the less-than and greater-than relations 
‘O’, which are quite different from angle brackets ‘()’. 

f Plain TjyX also makes available a few more delimiters, which were not listed 
in the basic set of 22 because they are sort of special. The control sequences 
\arrowvert, \Arrowvert, and \bracevert produce delimiters made from the repeatable 
parts of the vertical arrows, double vertical arrows, and large braces, respectively, 
without the arrowheads or the curly parts of the braces. They produce results similar 
to \vert or Wert, but they are surrounded by more white space and they have a 
different weight. You can also use \lgroup and \rgroup, which are constructed from 
braces without the middle parts; and \lmoustache and \rmoustache, which give you 
the top and bottom halves of large braces. For example, here are the \Big and \bigg 
versions of \vert, Wert, and these seven special delimiters: 


( 111 ' 

(il l 


Notice that \lgroup and \rgroup are rather like bold parentheses, with sharper bends 
at the corners; this makes them attractive for certain large displays. But you can- 
not use them exactly like parentheses, because they are available only in large sizes 
(\Big or more). 

($)&) Question: What happens if a subscript or superscript follows a large delim- 
JL JL iter? Answer: That’s a good question. After a \left delimiter, it is the first 
subscript or superscript of the enclosed subformula, so it is effectively preceded by {}. 
After a \right delimiter, it is a subscript or superscript of the entire \left. . Aright 
subformula. And after a \bigl or \bigr or \bigm or \big delimiter, it applies only to 
that particular delimiter. Thus, ‘\bigl(_2’ works quite differently from ‘\left(_2’. 

f lf you look closely at the examples of math typesetting in this chapter, you 
will notice that large parentheses and brackets are symmetric with respect to 
an invisible horizontal line that runs a little bit above the baseline; when a delimiter 
gets larger, its height and depth both grow by the same amount. This horizontal line 
is called the axis of the formula; for example, a formula in the text of the present 

paragraph would have an axis at this level: . The bar line in every fraction is 

centered on the axis, regardless of the size of the numerator or denominator. 

f Sometimes it is necessary to create a special box that should be centered 
vertically with respect to the axis. (For example, the ‘|x| = {...’ example 
above was done with such a box. ) TjrX provides a simple way to do this: You just say 


\vcenter{ (vertical mode material)} 


nulldelimiterspace 

langle 

rangle 

less-than 

greater-than 

angle brackets 

arrowvert 

Arrowvert 

bracevert 

vert 

Vert 

lgroup 

rgroup 

lmoustache 

rmoustache 

moustaches 

subscript 

superscript 

baseline 

axis 
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and the vertical mode material will be packed into a box just as if \vcenter had been 
\vbox. Then the box will be raised or lowered until its top edge is as far above the axis 
as the bottom edge is below. 

The concept of “axis” is meaningful for Tp'X only in math formulas, not in 
JL JL ordinary text; therefore TjXK allows you to use \vcenter only in math mode. 
If you really need to center something vertically in horizontal mode, the solution is 
to say ‘$\vcenter{. . . }$’. (Incidentally, the constructions ‘\vcenter to(dimen)’ and 
‘\vcenter spread(dimen)’ are legal too, in math mode; vertical glue is always set by 
the rules for \vbox in Chapter 12. But \vcenter by itself is usually sufficient.) 

f Any box can be put into a formula by simply saying \hbox or \vbox or \vtop 
or \box or \copy in the normal way, even when you are in math mode. Fur- 
thermore you can use \raise or Xlower, as if you were in horizontal mode, and you 
can insert vertical rules with \vrule. Such constructions, like \vcenter, produce boxes 
that can be used like ordinary symbols in math formulas. 

/>)/>) Sometimes you need to make up your own symbols, when you run across 
JL JL something unusual that doesn’t occur in the fonts. If the new symbol occurs 
only in one place, you can use \hbox or \vcenter or something to insert exactly what 
you want; but if you are defining a macro for general use, you may want to use different 
constructions in different styles. TjrX has a special feature called \mathchoice that 
comes to the rescue in such situations: You write 

\mathchoi ce{ (math) }{(mat.h)}{ (math) }{ (math)} 

where each (math) specifies a subformula. TjgX will choose the first subformula in style 
D or D' , the second in style T or T' , the third in style S or S' , the fourth in style SS 
or SS' . (TjjX actually typesets all four subformulas, before it chooses the final one, 
because the actual style is not always known at the time a \mathchoice is encountered; 
for example, when you type ‘\over’ you often change the style of everything that has 
occurred earlier in the formula. Therefore \mathchoice is somewhat expensive in terms 
of time and space, and you should use it only when you’re willing to pay the price.) 

(£)(£)*■ EXERCISE 17.15 

JL JL Guess what output is produced by the following commands: 

\def\puzzle{{\mathchoice{D}{T}{S}{SS}}} 

$$\puzzle{\puzzle\over\puzzle~{\puzzle"\puzzle}}$$ 

(£)(£)*■ EXERCISE 17.16 

JL JL Devise a ‘\square’ macro that produces a for use in math formulas. The 
box should be symmetrical with respect to the axis, and its inside dimensions should 
be 3 pt in display and text styles, 2.1 pt in script styles, and 1.5 pt in scriptscript styles. 
The rules should be 0.4 pt thick in display and text styles, 0.3 pt thick otherwise. 

/gb/gb Plain TjjX has a macro called \mathpalette that is useful for \mathchoice 
JL JL constructions; ‘\mathpalette\a{xyz}’ expands to the four-pronged array of 
choices ‘\mathchoice {\a\display style {xyz}} . . . {\a\scriptscriptstyle {xyz}} ’. 
Thus the first argument to \mathpalette is a control sequence whose first argument is 
a style selection. Appendix B contains several examples that show how \mathpalette 
can be applied. (See in particular the definitions of \phantom, \root, and \smash; the 
congruence sign \cong (=) is also constructed from = and ~ using \mathpalette.) 
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feature, 


At the beginning of this chapter we discussed the commands \over, \atop, 
\choose, and \above. These are special cases of TjgX’s “generalized fraction” 
which includes also the three primitives 


\overwithdelims (delinii } (deling) 
\atopwithdelims {delinii ) (deling) 
\abovewithdel ims (delinii } {deling ) {dimen) 


The third of these is the most general, as it encompasses all of the other generalized 
fractions: \overwithdelims uses a fraction bar whose thickness is the default for the 
current size, and \atopwithdelims uses an invisible fraction bar whose thickness is zero, 
while \abovewithdelims uses a bar whose thickness is specified explicitly. TJrX places 
the immediately preceding subformula (the numerator) over the immediately following 
subformula (the denominator), separated by a bar line of the desired thickness; then it 
puts (delinii) at the left and (delim 2 ) at the right. For example, ‘\choose’ is equivalent 
to ‘\atopwithdelims () ’. If you define \legendre to be ‘\overwithdelims () ’, you 
can typeset the Legendre symbol ‘{f)' by saying ‘{a\legendre b}’. The size of the 
surrounding delimiters depends only on the style, not on the size of the fractions; larger 
delimiters are used in styles D and D' (see Appendix G). The simple commands \over, 
\atop, and \above are equivalent to the corresponding ‘withdelims’ commands when 
the delimiters are null; for example, ‘\over’ is an abbreviation for ‘\overwithdelims . . ’. 

EXERCISE 17.17 

JL JL Define a control sequence \euler so that the Eulerian number (fy will be 
produced when you type ‘{n\euler k}’ in a formula. 


/>>/>) Appendix G explains exactly how T£X computes the desired size of delimiters 
JL JL for \left and \right. The general idea is that delimiters are vertically cen- 
tered with respect to the axis; hence, if we want to cover a subformula between \left 
and \right that extends y i units above the axis and y-i units below, we need to make 
a delimiter whose height plus depth is at least y units, where y = 2max(j/i,j/ 2 ). It is 
usually best not to cover the formula completely, however, but just to come close; so 
TEX allows you to specify two parameters, the \delimiterfactor / (an integer) and 
the \delimitershortf all S (a dimension). The minimum delimiter size is taken to be 
at least y ■ f / 1000, and at least y — 5. Appendix B sets / = 901 and 5 = 5pt. Thus, 
if y = 30 pt, the plain TEX format causes the delimiter to be more than 27 pt tall; if 
y = 100 pt, the corresponding delimiter will be at least 95 pt tall. 


So far we have been discussing the rules for typing math formulas, but we 
haven’t said much about how TEX actually goes about converting its input into 
lists of boxes and glue. Almost all of the control sequences that have been mentioned in 
Chapters 16 and 17 are “high level” features of the plain TEX format; they are not built 
into TEX itself. Appendix B defines those control sequences in terms of more primitive 
commands that TEX actually deals with. For example, ‘\choose’ is an abbreviation 
for Aatopwithdelims () ’; Appendix B not only introduces \choose, it also tells TEX 
where to find the delimiters ( and ) in various sizes. The plain TEX format defines all 
of the special characters like \alpha and \mapsto, all of the special accents like \tilde 
and \widehat, all of the large operators like \sum and \int, and all of the delimiters 
like \lfloor and \vert. Any of these things can be redefined, in order to adapt TEX 
to other mathematical styles and/or to other fonts. 
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f Tlie remainder of this chapter discusses the low-level commands that TjjX 
actually obeys behind the scenes. Every paragraph on the next few pages is 
marked with double dangerous bends, so you should skip to Chapter 18 unless you are 
a glutton for TjrXnicalities. 

/gb/gb All characters that are typeset in math mode belong to one of sixteen families 
JL JL of fonts, numbered internally from 0 to 15. Each of these families consists 
of three fonts: one for text size, one for script size, and one for scriptscriptsize. The 
commands \textfont, \scriptfont, and \scriptscriptf ont are used to specify the 
members of each family. For example, family 0 in the plain TjjX format is used for 
roman letters, and Appendix B contains the instructions 

\textfontO=\tenrm 
\scriptfontO=\sevenrm 
\scriptscriptfontO=\f iverm 

to set up this family: The 10-point roman font (\tenrm) is used for normal symbols, 
7-point roman (\sevenrm) is used for subscripts, and 5-point roman (\fiverm) is used 
for sub-subscripts. Since there are up to 256 characters per font, and 3 fonts per family, 
and 16 families, TjgX can access up to 12,288 characters in any one formula (4096 in each 
of the three sizes). Imagine that. 

/gb/gb A definition like \textf ont (family number)=(font identifier) is local to the 
JL JL group that contains it, so you can easily change family membership from one 
set of conventions to another and back again. Furthermore you can put any font into 
any family; for example, the command 

\ scr ipt scr iptf ont 0=\scriptf ontO 

makes sub-subscripts in family 0 the same size as the subscripts currently are. TjrX 
doesn’t check to see if the families are sensibly organized; it just follows instructions. 
(However, fonts cannot be used in families 2 and 3 unless they contain a certain number 
of special parameters, as we shall see later.) Incidentally, TjgX uses \nullfont, which 
contains no characters, for each family member that has not been defined. 

/gb/gb During the time that a math formula is being read, TfrX remembers each 
JL JL symbol as being “character position so-and-so in family number such-and- 
such,” but it does not take note of what fonts are actually in the families until reaching 
the end of the formula. Thus, if you have loaded a font called \Helvetica that contains 
Swiss-style numerals, and if you say something like 

$\textf ontO=\tenrm 9 \textfontO=\Helvetica 9$ 

you will get two 9’s in font \Helvetica, assuming that TjgX has been set up to take 9’s 
from family 0. The reason is that \textfontO is \Helvetica at the end of the formula, 
and that’s when it counts. On the other hand, if you say 

$\textf ontO=\tenrm 9 \hbox{$9\textfontO=\Helvetica$}$ 

the first 9 will be from \tenrm and the second from \Helvetica, because the formula 
in the hbox will be typeset before it is incorporated into the surrounding formula. 

<§)(£)*■ EXERCISE 17.18 

JL JL If you say ‘${\textfontO=\Helvetica 9}$’, what font will be used for the 9? 
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/>)/>) Every math character is given an identifying code number between 0 and 4095, 
JL JL obtained by adding 256 times the family number to the position number. This 
is easily expressed in hexadecimal notation, using one hexadecimal digit for the family 
and two for the character; for example, "24A stands for character "4A in family 2. Each 
character is also assigned to one of eight classes, numbered 0 to 7, as follows: 


Class 

Meaning 

Example 

Class 

Meaning 

Example 

0 

Ordinary 

/ 

4 

Opening 

( 

1 

Large operator 

\sum 

5 

Closing 

) 

2 

Binary operation 

+ 

6 

Punctuation 

> 

3 

Relation 

= 

7 

Variable family 

X 


Classes 0 to 6 tell what “part of speech” the character belongs to, in math-printing 
language; class 7 is a special case discussed below. The class number is multiplied by 
4096 and added to the character number, and this is the same as making it the leading 
digit of a four-digit hexadecimal number. For example, Appendix B defines \sum to 
be the math character "1350, meaning that it is a large operator (class 1) found in 
position "50 of family 3. 


(>)<>)>' EXERCISE 17.19 

JL JL The \oplus and \bullet symbols (© and •) are binary operations that appear 
in positions 8 and 15 (decimal) of family 2, when the fonts of plain TjrX are being used. 
Guess what their math character codes are. (This is too easy.) 

/gb/gb Class 7 is a special case that allows math symbols to change families. It 
JL JL behaves exactly like class 0, except that the specified family is replaced by the 
current value of an integer parameter called \fam, provided that \fam is a legal family 
number (i.e., if it lies between 0 and 15). TRX automatically sets \fam=-l whenever 
math mode is entered; therefore class 7 and class 0 are equivalent unless \fam has been 
given a new value. Plain TpX changes \fam to 0 when the user types Arm’; this makes 
it convenient to get roman letters in formulas, as we will see in Chapter 18, since 
letters belong to class 7. (The control sequence \rm is an abbreviation for ‘\fam=0 
\tenrm’; thus, \rm causes \fam to become zero, and it makes \tenrm the “current 
font.” In horizontal mode, the \fam value is irrelevant and the current font governs the 
typesetting of letters; but in math mode, the current font is irrelevant and the \fam 
value governs the letters. The current font affects math mode only if \ u is used or if 
dimensions are given in ex or em units; it also has an effect if an \hbox appears inside 
a formula, since the contents of an hbox are typeset in horizontal mode.) 


/gb/gb The interpretation of characters in math mode is defined by a table of 256 
JL JL “mathcode” values; these table entries can be changed by the \mathcode com- 
mand, just as the category codes are changed by \catcode (see Chapter 7). Each 
mathcode specifies class, family, and character position, as described above. For exam- 
ple, Appendix B contains the commands 


\mathcode ‘<="313C 
\mathcode ‘ *="2203 
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which cause TgX to treat the character “<’ in math mode as a relation (class 3) found 
in position "3C of family 1, and to treat an asterisk as a binary operation found in 
position 3 of family 2. The initial value of \mathcode‘b is "7162; thus, b is character 
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"62 in family 1 (italics), and its family will vary with \fam. (INITEX starts out with 
\mathcode x = x for all characters x that are neither letters nor digits. The ten digits 
have \mathcode x = x+"7000; the 52 letters have \mathcode x = a~+"7100.) TjAC looks 
at the matlicode only when it is typesetting a character whose catcode is 11 (letter) or 
12 (other), or when it encounters a character that is given explicitly as \char (number). 

/gK/gK A \mathcode can also have the special value "8000, which causes the character 
JL JL to behave as if it has catcode 13 (active). Appendix B uses this feature to 
make ’ expand to "{\prime} in a slightly tricky way. The matlicode of ’ does not 
interfere with the use of ’ in octal constants. 

($>')($>') The matlicode table allows you to refer indirectly to any character in any 
JL JL family, with the touch of a single key. You can also specify a math character 
code directly, by typing \mathchar, which is analogous to \char. For example, the 
command Amathchar'TABC’ specifies a character of class 1, family 10 ("A), and position 
"BC. A hundred or so definitions like 

\def\sum{\mathchar" 1350 } 

would therefore suffice to define the special symbols of plain IjgX- But there is a better 
way: TjjX has a primitive command \mathchardef , which relates to \mathchar just as 
\chardef does to \char. Appendix B has a hundred or so definitions like 

\mathchar def \ sum= "1350 

to define the special symbols. A \mathchar must be between 0 and 32767 ("7FFF). 

/gL/gL A character of class 1, i.e., a large operator like \sum, will be vertically centered 
JL JL with respect to the axis when it is typeset. Thus, the large operators can be 
used with different sizes of type. This vertical adjustment is not made for symbols of 
the other classes. 

/gL/gL TgX associates classes with subformulas as well as with individual characters. 
JL JL Thus, for example, you can treat a complex construction as if it were a bi- 
nary operation or a relation, etc., if you want to. The commands \mathord, \mathop, 
\mathbin, \mathrel, \mathopen, \mathclose, and \mathpunct are used for this pur- 
pose; each of them is followed either by a single character or by a subformula in 
braces. For example, \mathopen\mathchar" 1234 is equivalent to \mathchar"4234, be- 
cause \mathopen forces class 4 (opening). In the formula ‘$G\mathbin: H$’, the colon 
is treated as a binary operation. And Appendix B constructs large opening symbols 
by defining \bigl#l to be an abbreviation for 

\mathopen{\hbox{$\left#l . . Aright. $}} 


There’s also an eighth classification, \mathinner, which is not normally used for in- 
dividual symbols; fractions and \left. . Aright constructions are treated as “inner” 
subformulas, which means that they will be surrounded by additional space in cer- 
tain circumstances. All other subformulas are generally treated as ordinary symbols, 
whether they are formed by \overline or \hbox or \vcenter or by simply being en- 
closed in braces. Thus, \mathord isn’t really a necessary part of the TjrX language; 
instead of typing ‘$l\mathord,234$’ you can get the same effect from ‘$1{,}234$’. 
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<&<!>)*■ EXERCISE 17.20 

JL JL Commands like \mathchardef \alpha="010B are used in Appendix B to define 
the lowercase Greek letters. Suppose that you want to extend plain TgX by putting 
boldface math italic letters in family 9, analogous to the normal math italic letters 
in family 1. (Such fonts aren’t available in stripped down versions of T^rX, but let’s 
assume that they exist.) Assume that the control sequence \bmit has been defined as 
an abbreviation for ‘\f am=9’; hence ‘{\bmit b}’ will give a boldface math italic b. What 
change to the definition of \alpha will make {\bmit\alpha} produce a boldface alpha? 

/>)/>) Delimiters are specified in a similar but more complicated way. Each character 
JL JL lias not only a \catcode and a \mathcode but also a \delcode, which is either 
negative (for characters that should not act as delimiters) or less than "1000000. In 
other words, nonnegative delcodes consist of six hexadecimal digits. The first three 
digits specify a “small” variant of the delimiter, and the last three specify a “large” 
variant. For example, the command 

\delcode ‘ x=" 123456 


means that if the letter x is used as a delimiter, its small variant is found in position 
"23 of family 1, and its large variant is found in position "56 of family 4. If the 
small or large variant is given as 000, however (position 0 of family 0), that variant is 
ignored. T|-'X looks at the delcode when a character follows \left or \right, or when 
a character follows one of the withdelims commands; a negative delcode leads to an 
error message, but otherwise Tj?X finds a suitable delimiter by first trying the small 
variant and then the large. (Appendix G discusses this process in more detail.) For 
example, Appendix B contains the commands 

\delcode ‘ (="028300 \delcode ‘ . =0 


which specify that the small variant of a left parenthesis is found in position "28 of 
family 0, and that the large variant is in position 0 of family 3; also, a period has 
no variants, hence ‘\left . ’ will produce a null delimiter. There actually are several 
different left parenthesis symbols in family 3; the smallest is in position 0, and the others 
are linked together by information that comes with the font. All delcodes are —1 until 
they are changed by a \delcode command. 


EXERCISE 17.21 

JL JL Appendix B defines \delcode ‘ < so that there is a shorthand notation for angle 
brackets. Why do you think Appendix B doesn’t go further and define \delcode‘{? 


/gb/gb A delimiter can also be given directly, as ‘\delimiter(number)’. In this case 
JL JL the number can be as high as "7FFFFFF, i.e., seven hexadecimal digits; the 
leading digit specifies a class, from 0 to 7, as in a \mathchar. For example, Appendix B 
contains the definition 


\def\langle{\delimiter"426830A } 

and this means that Mangle is an opening (class 4) whose small variant is "268 and 
whose large variant is "30A. When \delimiter appears after \left or Wight, the 
class digit is ignored; but when \delimiter occurs in other contexts, i.e., when T|.;X 
isn’t looking for a delimiter, the three rightmost digits are dropped and the remaining 
four digits act as a \mathchar. For example, the expression ‘$\langle x$’ is treated 
as if it were ‘$\mathchar"4268 x$’. 
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► EXERCISE 17.22 

Wliat goes wrong if you type ‘\bigl\delimiter"426830A’ ? 



Granted that these numeric conventions for \mathchar and \delimiter are not 
beautiful, they sure do pack a lot of information into a small space. That’s why 


T)gX uses them for low-level definitions inside formats. Two other low-level primitives 


also deserve to be mentioned: \radical and \mathaccent. Plain T^X makes square 


root signs and math accents available by giving the commands 


\def\sqrt{\radical"270370 } 
\def\widehat{\mathaccent"362 } 


and several more like them. The idea is that \radical is followed by a delimiter 
code and \mathaccent is followed by a math character code, so that T|-'X knows the 
family and character positions for the symbols used in radical and accent constructions. 
Appendix G gives precise information about the positioning of these characters. By 
changing the definitions, l^jX could easily be extended so that it would typeset a variety 
of different radical signs and a variety of different accent signs, if such symbols were 
available in the fonts. 


/>>/>) Plain TjgX uses family 1 for math italic letters, family 2 for ordinary math 
JL JL symbols, and family 3 for large symbols. IgX insists that the fonts in fami- 
lies 2 and 3 have special \fontdimen parameters, which govern mathematical spacing 
according to the rules in Appendix G; the cmsy and cmex symbol fonts have these 
parameters, so their assignment to families 2 and 3 is almost mandatory. (There is, 
however, a way to modify the parameters of any font, using the \fontdimen command.) 
IMITEX initializes the mathcodes of all letters A to Z and a to z so that they are symbols 
of class 7 and family 1; that’s why it is natural to use family 1 for math italics. Sim- 
ilarly, the digits 0 to 9 are class 7 and family 0. None of the other families is treated 
in any special way by l^rX. Thus, for example, plain TpX puts text italic in family 4, 
slanted roman in family 5, bold roman in family 6, and typewriter type in family 7, but 
any of these numbers could be switched around. There is a macro \newfam, analogous 
to \newbox, that will assign symbolic names to families that aren’t already used. 


/>)/>) When TpX is in horizontal mode, it is making a horizontal list: in vertical 
JL JL mode, it is making a vertical list. Therefore it should come as no great surprise 
that TpX is making a math list when it is in math mode. The contents of horizontal 
lists were explained in Chapter 14, and the contents of vertical lists were explained in 
Chapter 15; it’s time now to describe what math lists are made of. Each item in a 
math list is one of the following types of things: 

■ an atom (to be explained momentarily); 

■ horizontal material (a rule or discretionary or penalty or “whatsit”); 

■ vertical material (from \mark or \insert or \vadjust); 

■ a glob of glue (from \hskip or \mskip or \nonscript); 

■ a kern (from \kern or \mkern); 

■ a style change (from \displaystyle, \textstyle, etc.); 

■ a generalized fraction (from \above, \over, etc.); 

■ a boundary (from \left or \right); 

■ a four-way choice (from \mathchoice). 
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The most important items are called atoms, and they have three parts: 
nucleus, a superscript, and a subscript. For example, if you type 


a 


(x_i+y) ~{\overline{n+l}} 


in math mode, you get a math list consisting of five atoms: (, Xi, +, y, and )" +1 . 
The nuclei of these atoms are (, x, -fi, y, and ); the subscripts are empty except for 
the second atom, which has subscript i; the superscripts are empty except for the last 
atom, whose superscript is n + 1. This superscript is itself a math list consisting of one 
atom, whose nucleus is n + 1; and that nucleus is a math list consisting of three atoms. 


There are thirteen kinds of atoms, each of which 
formula; for example, *(’ is an Open atom because 
Here is a complete list of the different kinds: 


might act differently in a 
it comes from an opening. 


Ord 

Op 

Bin 

Rel 

Open 

Close 

Punct 

Inner 

Over 

Under 

Acc 

Rad 

Vcent 


is an ordinary atom like 'x ' : 

is a large operator atom like ; 

is a binary operation atom like “+’ ; 

is a relation atom like “=’ j 

is an opening atom like ‘ ; 

is a dosing atom like ‘ ; 

is a punctuation atom like *$’ ; 

is an inner atom like ‘ y 1 ; 

is an overline atom like l x ' ; 

is an underline atom like ‘xr ; 

is an accented atom like ‘x’ ; 

is a radical atom like 1 y/2 ' ; 

is a vbox to be centered, produced by \vcenter. 



An atom’s nucleus, superscript, and subscript are called its Selds, and there 
are four possibilities for each of these fields; a field can be 


■ empty; 

■ a math symbol (specified by family and position number); 

■ a box; or 

■ a math list. 

For example, the Close atom )" +1 considered above has an empty subscript field; its 
nucleus is the symbol “)’, which is character "28 of family 0 if the conventions of plain 
TpX are in force; and its superscript field is the math list n+ 1. The latter math list 
consists of an Over atom whose nucleus is the math list n + 1; and that math list, in 
turn, consists of three atoms of types Ord, Bin, Ord. 


^>> You can see I^X’s view of a math list by typing \showlists in math mode. 
For example, after ‘$(x_i+y) ~{\overline{n+l}}\showlists’ your log file gets 
the following curious data: 


\mathopen 
. \famO ( 
\mathord 
.\faml x 
_\faml i 
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subscript 
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fields 

showlists 

internal list format 
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\mathbin 
AfamO + 
\mathord 
.\faml y 
\mathclose 
AfamO ) 
"\overline 
* . \mathord 
* . . \f ami n 
* . \mathbin 
" . . \f amO + 
“ . \mathord 
* . . \f amO 1 


In our previous experiences with \showlists we observed that there can be boxes within 
boxes, and that each line in the log file is prefixed by dots to indicate its position in 
the hierarchy. Math lists have a slightly more complex structure; therefore a dot is 
used to denote the nucleus of an atom, a is used for the superscript field, and a 
is used for the subscript field. Empty fields are not shown. Thus, for example, the Ord 
atom Xi is represented here by three lines ‘\mathord’, ‘.\faml x’, and ‘_\faml i’. 


/gK/gK Certain kinds of atoms carry additional information besides their nucleus, 
_H_ JL subscript, and superscript fields: An Op atom will be marked ‘\limits’ or 
‘\nolimits 1 if the normal \displaylimits convention has been overridden; a Rad 
atom contains a delimiter field to specify what radical sign is to be used; and an Acc 
atom contains the family and character codes of the accent symbol. 

/gL/gL When you say \hbox{ . . . } in math mode, an Ord atom is placed on the current 
JL JL math list, with the hbox as its nucleus. Similarly, \vcenter{. . .} produces a 
Vcent atom whose nucleus is a box. But in most cases the nucleus of an atom will be 
either a symbol or a math list. You can experiment with \showlists to discover how 
other things like fractions and mathchoices are represented internally. 

/gL/gL Chapter 26 contains complete details of how math lists are constructed. As 
JL JL soon as math mode ends (i.e., when the closing “$’ occurs), Tj^X dismantles the 
current math list and converts it into a horizontal list. The rules for this conversion are 
spelled out in Appendix G. You can see “before and after” representations of such math 
typesetting by ending a formula with ‘\showlists$\showlists’; the first \showlists 
will display the math list, and the second will show the (possibly complex) horizontal 
list that is manufactured from it. 


The learning time is short. A few minutes gives the general flavor, and 
typing a page or two of a paper generally uncovers most of the misconceptions. 
— KERNIGHAN and CHERRY, A System for Typesetting Mathematics (1975) 

Within a few hours (a few days at most) 
a typist with no math or typesetting experience 
can be taught to input even the most complex equations. 
— PETER J. BOEHM, Software and Hardware Considerations for a 

Technical Typesetting System (1976) 
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Chapter 18: Fine Points of Mathematics Typing 


We have discussed most of the facilities needed to construct math formulas, but 
there are several more things a good mathematical typist will want to watch for. 
After you have typed a dozen or so formulas using the basic ideas of Chapters 
16 and 17, you will find that it’s easy to visualize the final appearance of a 
mathematical expression as you type it. And once you have gotten to that level, 
there’s only a little bit more to learn before you are producing formulas as beau- 
tiful as any the world has ever seen; tastefully applied touches of TgXnique will 
add a professional polish that works wonders for the appearance and readability 
of the books and papers that you type. This chapter talks about such tricks, 
and it also fills in a few gaps by mentioning some aspects of math that didn’t fit 
comfortably into Chapters 16 and 17. 

1. Punctuation. When a formula is followed by a period, comma, semicolon, 
colon, question mark, exclamation point, etc., put the punctuation after the $, 
when the formula is in the text; but put the punctuation before the $$ when the 
formula is displayed. For example, 

If $x<0$, we have shown that $$y=f(x).$$ 

TgX’s spacing rules within paragraphs work best when the punctuation marks 
are not considered to be part of the formulas. 

Similarly, don’t ever type anything like 

for $x = a, b$, or $c$. 

It should be 

for $x = a$ , $b$, or $c$. 

(Better yet, use a tie: ‘or~$c$’.) The reason is that TgX will typeset expression 
‘$x = a, b$’ as a single formula, so it will put a “thin space” between the comma 
and the b. This space will not be the same as the space that TpX puts after 
the comma after the b, since spaces between words are always bigger than thin 
spaces. Such unequal spacing looks bad, but when you type things right the 
spacing will look good. 

Another reason for not typing ‘$x = a, b$’ is that it inhibits the pos- 
sibilities for breaking lines in a paragraph: TgX will never break at the space 
between the comma and the b because breaks after commas in formulas are usu- 
ally wrong. For example, in the equation ‘$x = f(a, b)$’ we certainly don’t 
want to put ‘x = f(a,’ on one line and ‘6)’ on the next. 

Thus, when typing formulas in the text of a paragraph, keep the math 
properly segregated: Don’t take operators like — and = outside of the $’s, and 
keep commas inside the formula if they are truly part of the formula. But if a 
comma or period or other punctuation mark belongs linguistically to the sentence 
rather than to the formula, leave it outside the $’s. 

► EXERCISE 18.1 

Type this: R(n,t ) = 0(W 2 ), as t — > 0 + . 
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f Some mathematical styles insert a bit of extra space around formulas to sep- 
arate them from the text. For example, when copy is being produced on an 
ordinary typewriter that doesn’t have italic letters, the best technical typists have tra- 
ditionally put an extra blank space before and after each formula, because this provides 
a useful visual distinction. You might find it helpful to think of each $ as a symbol that 
has the potential of adding a little space to the printed output; then the rule about 
excluding sentence punctuation from formulas may be easier to remember. 

&)(&) IfeX does, in fact, insert additional space before and after each formula; the 
JL JL amount of such space is called \mathsurround, which is a (dimen)-valued 
parameter. For example, if you set \mathsurround=lpt, each formula will effectively 
be 2 points wider (1 pt at each side): 


For x =a, b, or c. (\mathsurround=lpt) 

For x = a, b, or c. (\mathsurround=Opt) 

This extra space will disappear into the left or right margin if the formula occurs at 
the beginning or end of a line. The value of \mathsurround that is in force when TjjX 
reads the closing $ of a formula is used at both left and right of that formula. Plain 
T)?X takes \mathsurround=Opt, so you won’t see any extra space unless you are using 
some other format, or unless you change \mathsurround yourself. 

2. Non-italic letters in formulas. The names of algebraic variables are usually 
italic or Greek letters, but common mathematical functions like ‘log’ are always 
set in roman type. The best way to deal with such constructions is to make 
use of the following 32 control sequences (all of which are defined in plain T^X 
format, see Appendix B): 


\arccos 

\cos 

\csc 

\exp 

\ker 

\ limsup 

\min 

\sinh 

\arcsin 

\cosh 

\deg 

\gcd 

Ug 

\ln 

\Pr 

\sup 

\ arctan 

\cot 

\det 

\hom 

\lim 

\log 

\sec 

\tan 

\arg 

\coth 

\dim 

\inf 

\liminf 

\max 

\sin 

\tanh 


These control sequences lead to roman type with appropriate spacing: 


Input 

$\sin2\theta=2\sin\theta\cos\theta$ 
$0 (n\log n\log\log n)$ 

$\Pr (X>x)=\exp(-x/\mu)$ 

$$\max_{l\le n\le m}\log_2P_n$$ 

$$\lim_{x\toO}{\sin x\over x}=l$$ 


Output 

sin 26 = 2 sin 6 cos 6 
0( /rlog nlog logn) 

Pr(A" > x) = exp(— x/p) 
max log 2 P n 

l<n<m 


. . smx 
lim 

®-»-o x 


= i 


The last two formulas, which are displays, show that some of the special control 
sequences are treated by Tp^X as “large operators” with limits just like JT The 
subscript on \max is not treated like the subscript on \log. Subscripts and 
superscripts will become limits when they are attached to \det, \gcd, \inf, 
\lim, \liminf, \limsup, \max, \min, \Pr, and \sup, in display style. 
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► EXERCISE 18.2 

Express the following display in plain TgX language, using ‘\nu’ for V: 


Pi(n) = lim Vfl-cos^HV"))- 


f lf you need roman type for some mathematical function or operator that isn’t 
included in plain TgX’s list of 32, it is easy to define a new control sequence by 
mimicking the definitions in Appendix B. Or, if you need roman type just for a “one 
shot" use, it is even easier to get what you want by switching to \rm type, as follows: 


$\sqrt{{\rm Var}(X)}$ 

$x_{\rm max}- x_{\rm min}$ 

${\rm LL} (k) \Rightarrow{\rm LR}(k)$ 
$\exp(x+{\rm constant}) $ 

$x“3+{\rm lower\ order\ terms}! 


yVar(A') 

^max min 

LL(fc) LR(Jfc) 
exp(r + constant) 

+ lower order terms 


Notice the uses of ‘\u’ in the last case: without them, the result would have been 
‘x 3 + lowerorderterms’, because ordinary blank spaces are ignored in math mode. 

f You can also use \hbox instead of \rm to get roman letters into formulas. For 
example, four of the last five formulas can be generated by 


$\sqrt{\hbox{Var} (X) }$ 

$\hbox{LL} (k) \Rightarrow\hbox{LR} (k) $ 
$\exp (x+\hbox{constant})$ 
$x“3+\hbox{lower order terms}! 


y/Var(A-) 

LL (jfc) => LR (k) 
exp(a: -F constant) 

* 3 + lower order terms 


In this case ‘Xu’ isn’t necessary, because the material in an \hbox is processed in horizon- 
tal mode, when spaces are significant. But such uses of \hbox have two disadvantages: 
(1) The contents of the box will be typeset in the same size, whether or not the box 
occurs as a subscript: for example, ‘!x_{\hbox{max}}!’ yields ‘tfniax’. (2) The font 
that’s used inside \hbox will be the “current font," so it might not be roman. For ex- 
ample, if you are typesetting the statement of some theorem that is in slanted type, and 
if that theorem refers to l !\sqrt{\hbox{Var} (X) }!’, you will get the unintended result 
1 sj Vax{X)\ In order to make sure that an \hbox uses roman type, you need to specify 
\rm, e.g., ‘!\sqrt{\hbox{\rm Var}(X)}!’; and then the \hbox serves no purpose. We 
will see later, however, that \hbox can be very useful in displayed formulas. 

&)&)*■ EXERCISE 18.3 

JL jl When the displayed formula ‘!!\lim_{n\to\infty}x_n {\rm\ exists} \iff 
\limsup_{n\to\infty}x_n = \liminf_{n\to\infty}x_n. !!’ is typeset with the stan- 
dard macros of plain TgX, you get 


lim x„ exists 


lim sup x n = lim inf x n . 


But some people prefer a different notation: Explain how you could change the defini- 
tions of \limsup and \liminf so that the display would be 


lim Xn exists 


lim x n = lim x n . 


nu 

rm 

control space 
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f Tlie word ‘mod’ is also generally set in roman type, when it occurs in formulas; 

but this word needs more care, because it is used in two different ways that 
require two different treatments. Plain T^X provides two different control sequences, 
\bmod and \pmod, for the two cases: \bmod is to be used when ‘mod’ is a binary operation 
(i.e., when it occurs between two quantities, like a plus sign usually does), and \pmod 
is to be used when ‘mod’ occurs parenthetically at the end of a formula. For example, 

$\gcd(m,n)=\gcd(n,m\bmod n)$ gcd(m, n) = gcd(n, m mod n) 

$x\equiv y+l\pmod{m"2}$ x = y + 1 (mod m 2 ) 


The ‘b’ in ‘\bmod’ stands for “binary”; the ‘p’ in ‘\pmod’ stands for “parenthesized.” 
Notice that \pmod inserts its own parentheses; the quantity that appears after ‘mod’ in 
the parentheses should be enclosed in braces, if it isn’t a single symbol. 



► EXERCISE 18.4 


What did poor B. L. User get when he typed ‘$x\equivO 

► EXERCISE 18.5 / n /, / ,\ / , \ 

Explain how to produce [ " ) = ( ^ ) ( '/ m0< M 
yk J \lk/p\J \k mod pj 


(\pmod y~n)$’? 
(mod p). 



The same mechanism that works for roman type in formulas can be used to 
get other styles of type as well. For example, \bf yields boldface: 

$\bf a+b=\Phi_m$ a + b = 4> m 


Notice that whole formula didn’t become emboldened in this example; the “+’ and 
“=’ stayed the same. Plain TJjX sets things up so that commands like \rm and \bf 
will affect only the uppercase letters A to Z, the lowercase letters a to z, the digits 
0 to 9, the uppercase Greek letters \Gamma to \0mega, and math accents like \hat and 
\tilde. Incidentally, no braces were used in this example, because $’s have the effect 
of grouping; \bf changes the current font, but the change is local, so it does not affect 
the font that was current outside the formula. 


The bold fonts available in plain Tp:X are “bold roman,” rather than “bold 
JL JL italic,” because the latter are rarely needed. However, T)?X could readily be 
set up to make use of bold math italics, if desired (see Exercise 17.20). A more extensive 
set of math fonts would also include script, Fraktur, and “blackboard bold” styles; plain 
TJrX doesn’t have these, but other formats like xIatS-T^X do. 

f Besides \rm and \bf, you can say \cal in formulas to get uppercase letters in 
a “calligraphic” style. For example, ‘$\cal A$’ produces ‘A 1 and ‘$\cal Z$’ 
produces ‘Z\ But beware: This works only with the letters A to Z; you’ll get weird 
results if you apply \cal to lowercase or Greek letters. 

f There’s also \mit, which stands for “math italic.” This affects uppercase 
Greek, so that you get (T, A, Q, A, S, IT, E, T, <P, T, ft) instead of (T, ...,0). 
When \mit is in effect, the ordinary letters A to Z and a to z are not changed; they 
are set in italics as usual, because they ordinarily come from the math italic font. 
Conversely, uppercase Greek letters and math accents are unaffected by \rm, because 
they ordinarily come from the roman font. Math accents should not be used when the 
\mit family has been selected, because the math italic font contains no accents. 
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► EXERCISE 18.6 

Type the formula x T Mx = 0 x = 0, using as few keystrokes as possible, 

first ‘0’ is roman, the second is bold. The superscript ‘T’ is roman.) 



► EXERCISE 18.7 

Figure out how to typeset ‘S' C S 


S € S’. 


f Plain T|.’X also allows you to type \it, \sl, or \tt, if you want text italic, 
slanted, or typewriter letters to occur in a math formula. However, these fonts 
are available only in text size, so you should not try to use them in subscripts. 


f lf you’re paying attention, you probably wonder why both \mit and \it are 
provided; the answer is that \mit is “math italic” (which is normally best for 
formulas), and Nit is “text italic” (which is normally best for running text). 


$This\ is\ math\ italic.! This is math italic. 

{\it This is text italic.} This is text italic. 


The math italic letters are a little wider, and the spacing is different; this works better 
in most formulas, but it fails spectacularly when you try to type certain italic words 
like ‘ different ’ using math mode (‘$different$’). A wide ‘/’ is usually desirable 
in formulas, but it is undesirable in text. Therefore wise typists use Nit in a math 
formula that is supposed to contain an actual italic word. Such cases almost never 
occur in classical mathematics, but they are common when computer programs are 
being typeset, since programmers often use multi-letter “identifiers”: 

$Nit last:=first$ last := first 

$Nit xN_coord(pointN_2)$ x -coord (point -2) 


The first of these examples shows that T)gX recognizes the ligature ‘fi' when text italic 
occurs in a math formula; the other example illustrates the use of short underlines to 
break up identifier names. When the author typeset this manual, he used ‘$Nit SS$’ 
to refer to style SS, since ‘$SS$’ makes the S's too far apart: SS. 



► EXERCISE 18.8 

What plain TJ^X commands will produce the following display? 


n 

available + max [full (i), reserved(i )) = capacity. 

i= 1 


<>)(>)*■ EXERCISE 18.9 

JL JL How would you go about typesetting the following computer program, using 
the macros of plain l}rX? 


for j := 2 step 1 until n do 

begin accum := A[j ]; k := j — 1; A[0] := accum; 
while A[k\ > accum do 

begin A[k + 1] := A[k\; k := k — 1; 

end; 

A[k + 1] := accum; 

end. 
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3. Spacing between formulas. Displays often contain more than one formula; for 
example, an equation is frequently accompanied by a side condition: 

F n = F n _ i + F n _ 2 , n > 2. 

In such cases you need to tell how much space to put after the comma, 
because Tf^X’s normal spacing conventions would bunch things together; without 
special precautions you would get 

F n = F n - 1 + F„_o, n > 2. 


The traditional hot-metal technology for printing has led to some in- 
grained standards for situations like this, based on what printers call a “quad” 
of space. Since these standards seem to work well in practice, T^X makes it easy 
for you to continue the tradition: When you type ‘\quad’ in plain TgX format, 
you get a printer’s quad of space in the horizontal direction. Similarly, ‘\qquad’ 
gives you a double quad (twice as much); this is the normal spacing for situations 
like the F n example above. Thus, the recommended procedure is to type 

$$ F_n = F_{n-1} + F_{n-2}, \qquad n \ge 2. $$ 

It is perhaps worth reiterating that TgX ignores all the spaces in math mode 
(except, of course, the space after ‘\qquad’, which is needed to distinguish be- 
tween ‘\qquad n’ and ‘\qquadn’); so the same result would be obtained if you 
were to leave out all but one space: 


$$F_n=F_{n-l}+F_{n-2} , \qquad n\ge2 . $$ 

Whenever you want spacing that differs from the normal conventions, you must 
specify it explicitly by using control sequences such as \quad and \qquad. 

f A quad used to be a square piece of blank type, 1 em wide and 1 em tall — 
approximately the size of a capital M, as explained in Chapter 10. This 
tradition has not been fully retained: The control sequence \quad in plain TgX is simply 
an abbreviation for ‘\hskip lem\relax’, so TjjX’s quad has width but no height. 

f You can use \quad in text as well as in formulas; for example, Chapter 14 
illustrates how \quad applies to poetry. When \quad appears in a formula it 
stands for one em in the current text font, independent of the current math size or 
style or family. Thus, for example, \quad is just as wide in a subscript as it is on the 
main line of a formula. 


Sometimes a careless author will put two formulas next to each other in 
the text of a paragraph. For example, you might find a sentence like this: 

The Fibonacci numbers satisfy F n = F n _ i + F n _ 2 , n > 2. 

Everybody who teaches proper mathematical style is agreed that formulas ought 
to be separated by words, not just by commas; the author of that sentence 
should at least have said ‘for n > 2’, not simply l n > 2’. But alas, such lapses 
are commonplace, and many prominent mathematicians are hopelessly addicted 
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to clusters of formulas. If we are not allowed to change their writing style, we can 
at least insert extra space where they neglected to insert an appropriate word. 
An additional interword space generally works well in such cases; for example, 
the sentence above was typeset thus: 

... $F_n=F_{n-l}+F_{n-2}$ , \ $n\ge2$.}$$ 

The ‘V here gives a visual separation that partly compensates for the bad style. 

► EXERCISE 18.10 

Put the following paragraph into TpX form, treating punctuation and spacing 
carefully; also insert ties to prevent bad line breaks. 

Let H be a Hilbert space, C a closed bounded convex subset of H, 

T a nonexpansive self map of C . Suppose that as n — > oo, a n ^ — > 0 
for each k, and j n = ^^ 0 ( a n,Hi _a n,*:) + — l > 0. Then for each x in C, 
A n x = YlT=o a n,kT k x converges weakly to a fixed point of T. 

4 • Spacing within formulas. Chapter 16 says that TpjX does automatic spacing of 
math formulas so that they look right, and this is almost true. But occasionally 
you must give Tj^X some help. The number of possible math formulas is vast, 
and TpX’s spacing rules are rather simple, so it is natural that exceptions should 
arise. Of course, it is desirable to have fine units of spacing for this purpose, 
instead of the big chunks that arise from \ u , \quad and \qquad. 

The basic elements of space that TpX puts into formulas are called thin 
spaces , medium spaces, and thick spaces. In order to get a feeling for these units, 
let’s take a look at the F n example again: Thick spaces occur just before and 
after the = sign, and also before and after the > ; medium spaces occur just 
before and after the + sign. Thin spaces are slightly smaller, but noticeable; it’s 
a thin space that makes the difference between ‘loglog’ and ‘log log’. The normal 
space between words of a paragraph is approximately equal to two thin spaces. 

Tf^X inserts thin spaces, medium spaces, and thick spaces into formulas 
automatically, but you can add your own spacing whenever you want to, by using 
the control sequences 

\, thin space (normally 1/6 of a quad); 

\> medium space (normally 2/9 of a quad); 

\; thick space (normally 5/18 of a quad); 

\! negative thin space (normally —1/6 of a quad). 

In most cases you can rely on TgX’s spacing while you are typing a manuscript, 
and you’ll want to insert or delete space with these four control sequences only 
in rare circumstances after you see what comes out. 

/gb/gb We observed a minute ago that \quad spacing does not change with the style 
JL JL of formula, nor does it depend on the math font families that are being used. 
But thin spaces, medium spaces, and thick spaces do get bigger and smaller as the size 
of type gets bigger and smaller; this is because they are defined in terms of (muglue), 
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a special brand of glue intended for math spacing. You specify (muglufi) just as if it 
were ordinary glue, except that the units are given in terms of ‘mu’ (math units) instead 
of pt or cm or something else. For example, Appendix B contains the definitions 

\thinmuskip = 3mu 

\medmuskip = 4mu plus 2mu minus 4mu 
\thickmuskip = 5mu plus 5mu 

and this defines the thin, medium, and thick spaces that TpX inserts into formulas. 
According to these specifications, thin spaces in plain TpX do not stretch or shrink; 
medium spaces can stretch a little, and they can shrink to zero; thick spaces can stretch 
a lot, but they never shrink. 

/^X/gX There are 18 mu to an em, where the em is taken from family 2 (the math 
JL JL symbols family). In other words, \textfont 2 defines the em value for mu 
in display and text styles; \scriptfont 2 defines the em for script size material; and 
\scriptscriptf ont 2 defines it for scriptscript size. 

/>)/>) You can insert math glue into any formula just by giving the command 
JL JL ‘\mskip{muglue)’. For example, ‘\mskip 9mu plus 2mu’ inserts one half em of 
space, in the current size, together with some stretcliability. Appendix B defines ‘\,’ 
to be an abbreviation for ‘\mskip\thinmuskip’. Similarly, you can use the command 
‘\mkern’ when there is no stretching or shrinking; ‘\mkernl8mu’ gives one em of hori- 
zontal space in the current size. TjgX insists that \mskip and \mkern be used only with 
mu; conversely, \hskip and \kern (which are also allowed in formulas) must never give 
units in mu. 


Formulas involving calculus look best when an extra thin space appears 
before dx or dy or cl whatever; but TpX doesn’t do this automatically. Therefore 
a well-trained typist will remember to insert ‘\,’ in examples like the following: 

Input Output 

$\int_0~\infty f(x)\,dx$ f£° f(x)dx 

$y\ , dx-x\ , dy$ y dx — x dy 

$dx\ , dy=r\ , dr\ , d\theta$ dx dy = r dr dO 

$x\ , dy/ dx$ x dy /dx 

Notice that no ‘\,’ was desirable after the 7’ in the last example. Similarly, 
there’s no need for ‘\ , ’ in cases like 

$$\int_l~x{dt\over t}$$ 

since the dt appears all by itself in the numerator of a fraction; this detaches it 
visually from the rest of the formula. 

► EXERCISE 18.11 

Explain how to handle the display 

j~ rt J-\ 

— — e mt dt = e ab Ei(ab), a , b > 0. 

t- + 6“ 
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When physical units appear in a formula, they should be set in roman type 
and separated from the preceding material by a thin space: 

$55\rm\ ,mi/hr$ 55 mi/hr 

$g=9.8\rm\,m/sec~2$ g = 9.8m/sec 2 

$\rml\ ,ml=l . 000028X , cc$ 1 ml = 1.000028 cc 


►EXERCISE 18.12 

JL Typeset the following display, assuming that ‘\hbar’ generates l h': 

h = 1.0545 x 10 -27 erg sec. 

f Thin spaces should also be inserted after exclamation points (which stand for 
the “factorial” operation in a formula), if the next character is a letter or a 
number or an opening delimiter: 


$(2n) !/\bigl(n!\, (n+1) !\bigr)$ 
$${52 ! \over 13 ! \ , 13 ! \ , 26 ! }$$ 


(2 n)\/(n\ (n + 1)!) 
52! 

13! 13! 26! 


Besides these cases, you will occasionally encounter formulas in which 
the symbols are bunched up too tightly, or where too much white space appears, 
because of certain unlucky combinations of shapes. It’s usually impossible to 
anticipate optical glitches like this until you see the first proofs of what you 
have typed; then you get to use your judgment about how to add finishing 
touches that provide extra beauty, clarity, and finesse. A tastefully applied ‘\,’ 
or ‘\ ! ’ will open things up or close things together so that the reader won’t be 
distracted from the mathematical significance of the formula. Square root signs 
and multiple integrals are often candidates for such fine tuning. Here are some 
examples of situations to look out for: 


$\sqrt2\ ,x$ 

s/2x 

$\sqrt{\ , \log x}$ 

x/logx 

$0\bigl(l/\sqrt n\,\bigr)$ 

0{1/Vn) 

$[\,0,1)$ 

[0,1) 

$\log n\ , (\log\log n) "2$ 

log n (log log n) 

$x~2\ ! /2$ 

ar/2 

$n/\ ! \log n$ 

n /log n 

$\Gamma_{\ ! 2}+\Delta~{\ ! 2}$ 

r 2 + A 2 

$R_iO~jO_A!kl}$ 

Ri j kl. 

$\int_0~x\ ! \int_0~y dF(u,v)$ 

Iofo y dF(u,v) 

$$\int\ ! \ ! \ ! \int_D dx\,dy$$ 

j 1 dx dy 


In each of these formulas the omission of \ , or \ ! would lead to somewhat less 
satisfactory results. 
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/gK/gK Most of these examples where thin-space corrections are desirable arise because 
JL JL of chance coincidences. For example, the superscript in $x"2/2$ leaves a hole 
before the slash (a: 2 /2); a negative thin space helps to fill that hole. The positive 
thin space in $\sqrt{\ ,\l°g x}$ compensates for the fact that ‘log at’ begins with a 
tall, unslanted letter; and so on. But two of the examples involve corrections that were 
necessary because TgX doesn’t really know a great deal about mathematics: (1) In the 
formula $\log n(\log\log n) "2$, TjgX inserts no thin space before the left parenthesis, 
because there are similar formulas like $\log n(x)$ where no such space is desired. 
(2) In the formula $n/\log n$, TjrX automatically inserts an unwanted thin space 
before \log, since the slash is treated as an ordinary symbol, and since a thin space is 
usually desirable between an ordinary symbol and an operator like \log. 


In fact, TjgX’s rules for spacing in formulas are fairly simple. A formula is con- 
verted to a math list as described at the end of Chapter 17, and the math list 
consists chiefly of “atoms” of eight basic types: Ord (ordinary), Op (large operator), 
Bin (binary operation), Rel (relation), Open (opening), Close (closing), Punct (punc- 
tuation), and Inner (a delimited subformula). Other kinds of atoms, which arise from 
commands like \overline or \mathaccent or \vcenter, etc., are all treated as type Ord; 
fractions are treated as type Inner. The following table is used to determine the spacing 
between pairs of adjacent atoms: 

Right atom 




Ord 

Op 

Bin 

Rel 

Open 

Close 

Punct 

Inner 


Ord 

0 

1 

(2) 

(3) 

0 

0 

0 

( 1 ) 


Op 

1 

1 

* 

(3) 

0 

0 

0 

( 1 ) 


Bin 

(2) 

(2) 

* 

* 

(2) 

* 

* 

(2) 

Left 

Rel 

(3) 

(3) 

* 

0 

(3) 

0 

0 

(3) 

atom 

Open 

0 

0 

* 

0 

0 

0 

0 

0 


Close 

0 

1 

(2) 

(3) 

0 

0 

0 

( 1 ) 


Punct 

( 1 ) 

( 1 ) 

* 

( 1 ) 

( 1 ) 

( 1 ) 

( 1 ) 

( 1 ) 


Inner 

( 1 ) 

1 

(2) 

(3) 

( 1 ) 

0 

( 1 ) 

( 1 ) 


Here 0, 1, 2, and 3 stand for no space, thin space, medium space, and thick space, 
respectively; the table entry is parenthesized if the space is to be inserted only in 
display and text styles, not in script and scriptscript styles. For example, many of the 
entries in the Rel row and the Rel column are ‘(3)’; this means that thick spaces are 
normally inserted before and after relational symbols like “=’, but not in subscripts. 
Some of the entries in the table are ‘*’| such cases never arise, because Bin atoms must 
be preceded and followed by atoms compatible with the nature of binary operations. 
Appendix G contains precise details about how math lists are converted to horizontal 
lists; this conversion is done whenever TjjX is about to leave math mode, and the 
inter atomic spacing is inserted at that time. 



For example, the displayed formula specification 
$$x+y=\max\{x,y\}+\min\{x,y\}$$ 


atoms 

Ord 

ordinary 

Op 

large operator 
Bin 

binary operation 
Rel 

relation 

Open 

opening 

Close 

closing 

Punct 

punctuation 

Inner 

overline 

mathaccent 

vcenter 

fractions 

spacing table 

math spacing table 


will be transformed into the sequence of atoms 

n ffl 0 b ms a 0 Eb0 0H E5I H| m „ u 0 
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of respective types Ord, Bin, Ord, Rel, Op, Open, Ord, Punct, Ord, Close, Bin, Op, 
Open, Ord, Punct, Ord, and Close. Inserting spaces according to the table gives 

Ord \> Bin \> Ord \ ; Rel \ ; Op Open Ord Punct \ , Ord Close \> 

Bin \> Op Open Ord Punct \ , Ord Close 


and the resulting formula is 


ra El i b ni230i|j eC| El ESjJqj e 


x + y = max{i, y} + min{a;, y} 

This example doesn’t involve subscripts or superscripts; but subscripts and superscripts 
merely get attached to atoms without changing the atomic type. 

EXERCISE 18.13 

JL JL Use the table to determine what spacing TgX will insert between the atoms 
of the formula ‘$f (x,y) <x"2+y~2$’. 

/gb/gb The plain IgX macros \bigl, \bigr, \bigm, and \big all produce identical 
JL JL delimiters; the only difference between them is that they may lead to different 
spacing, because they make the delimiter into different types of atoms: \bigl produces 
an Open atom, \bigr a Close, \bigm a Rel, and \big an Ord. On the other hand, when 
a subformula appears between \left and \right, it is typeset by itself and placed into 
an Inner atom. Therefore it is possible that a subformula enclosed by \left and \right 
will be surrounded by more space than there would be if that subformula were enclosed 
by \bigl and \bigr. For example, Ord followed by Inner (from \left) gets a thin 
space, but Ord followed by Open (from \bigl) does not. The rules in Chapter 17 
imply that the construction ‘\mathinner{\bigl({(subformula)}\bigr)}’ within any 
formula produces a result exactly equivalent to ‘\left((subformula)\right)’, when 
the (subformula) doesn’t end with Punct, except that the delimiters are forced to be 
of the \big size regardless of the height and depth of the subformula. 

f T^jX’s spacing rules sometimes fail when ‘I’ and ‘\ I ’ appear in a formula, 
because | and || are treated as ordinary symbols instead of as delimiters. For 
example, consider the formulas 


$ I —x I = I +x I $ 

$\lef 1 1 -x\right I =\lef 1 1 +x\right I $ 
$\lfloor-x\rf loor=-\lceil+x\rceil$ 


I -x\ = | +x\ 
1 - 2-1 = 1 + 2-1 
l-x] = -[+2:1 


In the first case the spacing is wrong because TgX thinks that the plus sign is computing 
the sum of ‘|’ and ‘x\ The use of \left and \right in the second example puts T[.;X 
on the right track. The third example shows that no such corrections are needed with 
other delimiters, because TgX knows whether they are openings or closings. 

&>&>>■ EXERCISE 18.14 

JL JL Some perverse mathematicians use brackets backwards, to denote “open in- 
tervals.” Explain how to type the following bizarre formula: ]— oo,T[ x ]— oo,T[. 

(£)(£)*■ EXERCISE 18.15 

JL JL Study Appendix G and determine what spacing will be used in the formula 
‘$x++l$’. Which of the plus signs will be regarded as a binary operation? 


bigl 

bigr 

bigm 

big 

left 

right 

mathinner 

delimiters 

verticalline 

verticalline 

brackets 

open intervals 

binary operation 
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5. Ellipses (“three dots”). Mathematical copy looks much nicer if you are careful 
about how groups of three dots are typed in formulas and text. Although it looks 
fine to type ‘ . . . ’ on a typewriter that has fixed spacing, the result looks too 
crowded when you’re using a printer’s fonts: ‘$x. . .y$’ results in l x..:y\ and 
such close spacing is undesirable except in subscripts or superscripts. 

An ellipsis can be indicated by two different kinds of dots, one higher 
than the other; the best mathematical traditions distinguish between these two 
possibilities. It is generally correct to produce formulas like 

X\ + • • • + x n and (;ri , . . . , x n ), 

but wrong to produce formulas like 

Xi + ... + ./•„ and (x % , • • • , ./•„ I. 

The plain Tf^X format of Appendix B allows you to solve the “three dots” problem 
very simply, and everyone will be envious of the beautiful formulas that you 
produce. The idea is simply to type \ldots when you want three low dots (...), 
and \cdots when you want three vertically centered dots (•••)• 

In general, it is best to use \cdots between + and — and x signs, and 
also between = signs or < signs or C signs or other similar relations. Low dots 
are used between commas, and when things are juxtaposed with no signs between 
them at all. For example: 


$x_l+\cdots+x_n$ 
$x_l=\cdots=x_n=0$ 
$A_l\times\cdots\times A_n$ 
$f (x_l ,\ldots ,x_n)$ 
$x_lx_2\ldots x_n$ 

$(l-x) (l-x~2)\ldots(l-x~n)$ 
$n(n-l) \ldots (1) $ 


xi d h x„ 

x l = ■ ■■ = x n = 0 
Ai x • • • x A n 
f(x x n ) 

XiX-2 ...x n 

(1 — x)(l — x 2 ) ... (1 — x n ) 
n(n — 1) ... (1) 


► EXERCISE 18.16 

Type the formulas L x\ + X1X2 + ■ ■ ■ + xix-2 ■ ■ ■ x n ’ and ‘(xi ,x n )-(y i, . • • , y n ) = 
xiyi + • • • + x n y n \ [Hint: A single raised dot is called ‘\cdot’.] 


But there’s an important special case in which \ldots and \cdots don’t 
give the correct spacing, namely when they appear at the very end of a formula, 
or when they appear just before a closing delimiter like “)’. In such situations 
an extra thin space is needed. For example, consider sentences like this: 


Prove that (1 — x) _1 = 1 + x + x 2 + ■ ■ ■ . 
Clearly a,; < foj for i = 1, 2, . . . , n. 

The coefficients Co, ci, . . . , c n are positive. 


ellipses 

three dots 

ldots 

cdots 

commas 

cdot 

thin space 


To get the first sentence, the author typed 

Prove that $(l-x)~{-l}=l+x+x~2+\cdots\,$. 
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Without the ‘ \ ’ the period would have come too close to the \cdots. Similarly, 
the second sentence was typed thus: 

Clearly $a_i<b_i$ for $i=l$,~2, $\ldots\ , $ , ~$n$ . 

Notice the use of ties, which prevent bad line breaks as explained in Chapter 14. 
Such ellipses are extremely common in some forms of mathematical writing, so 
plain allows you to say just ‘\dots’ as an abbreviation for ‘$\ldots\,$’ in 
the text of a paragraph. The third sentence can therefore be typed 

The coefficients $c_0$ , ~$c_l$ , \dots,~$c_n$ are positive. 

► EXERCISE 18.17 

B. C. Dull tried to take a shortcut by typing the second example this way: 

Clearly $a_i<b_i$ for~$i=l, 2, \ldots, n$ . 

What’s so bad about that? 

► EXERCISE 18.18 

How do you think the author typed the footnote in Chapter 4 of this book? 

6. Line breaking. When you have formulas in a paragraph, T|5]X may have to 
break them between lines. This is a necessary evil, something like the hyphen- 
ation of words; we want to avoid it unless the alternative is worse. 

A formula will be broken only after a relation symbol like = or < or — », 
or after a binary operation symbol like + or — or x , where the relation or binary 
operation is on the “outer level” of the formula (i.e., not enclosed in {. . .} and 
not part of an ‘\over’ construction). For example, if you type 

$f(x,y) = x~2-y~2 = (x+y)(x-y)$ 

in mid-paragraph, there’s a chance that Tj;X will break after either of the = signs 
(it prefers this) or after the - or + or - (in an emergency). But there won’t be 
a break after the comma in any case — commas after which breaks are desirable 
shouldn’t appear between $’s. 

If you don’t want to permit breaking in this example except after the 
= signs, you could type 

$f(x,y) = {x~2-y~2} = { (x+y) (x-y) >$ 

because these additional braces “freeze” the subformulas, putting them into un- 
breakable boxes in which the glue has been set to its natural width. But it isn’t 
necessary to bother worrying about such things unless TgX actually does break 
a formula badly, since the chances of this are pretty slim. 

f A “discretionary multiplication sign” is allowed in formulas: If you type 
‘$(x+y)\*(x-y)$’, TjgX will treat the \* something like the way it treats \-; 
namely, a line break will be allowed at that place, with the hyphenation penalty. How- 
ever, instead of inserting a hyphen, Tjr(K will insert a x sign in text size. 


ties 

dots 

Dull 

footnote 

line breaking in math 
breaking formulas between lines 
subformulas 

discretionary multiplication sign 
* 
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If you do want to permit a break at some point in the outer level of a formula, 
you can say \allowbreak. For example, if the formula 


$ (x_l , \ldots ,x_m, \allowbreak y_l , \ldots ,y_n)$ 


appears in the text of a paragraph, TgX will allow it to be broken into the two pieces 
‘(x\ i • • • i %m ? and ‘i/i, . . . ,y n )'- 

The penalty for breaking after a Rel atom is called \relpenalty, and the 
JL JL penalty for breaking after a Bin atom is called \binoppenalty. Plain TjrjX sets 
\relpenalty=500 and \binoppenalty=700. You can change the penalty for breaking 
in any particular case by typing ‘\penalty {number)’ immediately after the atom in 
question; then the number you have specified will be used instead of the ordinary 
penalty. For example, you can prohibit breaking in the formula ‘x = O’ by typing 
‘$x=\nobreakO$’, since \nobreak is an abbreviation for ‘\penaltyl0000 



► EXERCISE 18.19 

Is there any difference between the results of ‘$x=\nobreakO$’ and ‘${x=0}$’? 


(>)<>)*■ EXERCISE 18.20 

JL JL How could you prohibit all breaks in formulas, by making only a few changes 
to the macros of plain Tj?X? 


7. Braces. A variety of different notations have sprung up involving the symbols 
and plain TgX includes several control sequences that help you cope with 
formulas involving such things. 

In simple situations, braces are used to indicate a set of objects; for 
example, ‘{a, b, c}’ stands for the set of three objects a, b , and c. There’s nothing 
special about typesetting such formulas, except that you must remember to use 
\{ and \} for the braces: 


$\{a,b , c\}$ {a.,b,c} 

$\{1 ,2 , \ldots ,n\}$ {1,2,..., n} 

$\{\rm red, white, blue\}$ {red, white, blue} 

A slightly more complex case arises when a set is indicated by giving a generic 
element followed by a specific condition; for example, ‘{.t | x > 5}’ stands for 
the set of all objects x that are greater than 5. In such situations the control 
sequence \mid should be used for the vertical bar, and thin spaces should be 
inserted inside the braces: 


allowbreak 

relpenalty 

binoppenalty 

penalty 

nobreak 

braces 

leftbrace 

rightbrace 

set 

leftbrace 

rightbrace 

mid 

vertical bar 
colon 


$\{\,x\mid x>5\,\}$ { x \ x > 5 } 

$\{\,x:x>5\,\}$ { ;E : ;E > 5 } 

(Some authors prefer to use a colon instead of as in the second example here.) 
When the delimiters get larger, as in 


{ | x G D } 
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they should be called \bigl, \bigm, and \bigr; for example, the formula just 
given would be typed 

\bigl\{\ , \bigl (x , f (x) \bigr) \bigm | x\in D\ , \bigr\} 

and formulas that involve still larger delimiters would use \Big or \bigg or even 
\Bigg, as explained in Chapter 17. 

► EXERCISE 18.21 

How would you typeset the formula { x 3 | h(x) £ { — 1,0, +1} }? 

►EXERCISE 18.22 

JL Sometimes the condition that defines a set is given as a fairly long English 
description, not as a formula; for example, consider p \ p and p +2 are prime }’. An 
hbox would do the job: 

$\{\,p\nii<I\hbox{$p$ and $p+2$ are prime}\,\}$ 

but a long formula like this is troublesome in a paragraph, since an hbox cannot be 
broken between lines, and since the glue inside the \hbox does not vary with the 
interword glue in the line that contains it. Explain how the given formula could be 
typeset with line breaks allowed. [Hint: Go back and forth between math mode and 
horizontal mode.] 

Displayed formulas often involve another sort of brace, to indicate a 
choice between various alternatives, as in the construction 

, I = f x, if x > 0; 

'' 1 { —x, otherwise. 

You can typeset it with the control sequence \cases: 

$$ |x|=\cases{x,&if $x\geO$;\cr 

-x , feotherwise . \cr}$$ 

Look closely at this example and notice that it uses the character &, which we 
said in Chapter 7 was reserved for special purposes. Here for the first time in 
this manual we have an example of why & is so special: Each of the cases has 
two parts, and the & separates those parts. To the left of the & is a math formula 
that is implicitly enclosed in $...$; to the right of the & is ordinary text, which 
is not implicitly enclosed in $...$. For example, the ‘-x,’ in the second line 
will be typeset in math mode, but the ‘otherwise’ will be typeset in horizontal 
mode. Blank spaces after the & are ignored. There can be any number of cases, 
but there usually are at least two. Each case should be followed by \cr. Notice 
that the \cases construction typesets its own there is no corresponding ‘}’. 


1/3 

2/3 

0 


if 0 < x < 1; 
if 3 < x < 4; 
elsewhere. 


bigl 

bigm 

bigr 

Big 

bigg 

Bigg 

mode 

selection, see cases 
alternatives, see cases 
choices, see cases 
cases 

ampersand 

cr 


► EXERCISE 18.23 

Typeset the display f(x) 
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f You can insert ‘\noalign{ (vertical mode material)}’ just, after any \cr within 
\cases, as explained in Chapter 22, because \cases is an application of the 
general alignment, constructions considered in that chapter. For example, the command 
‘\noalign{\vskip2pt}’ can be used to put a little extra space between two of the cases. 

f Horizontal braces will be set over or under parts of a displayed formula if you 
use the control sequences \overbrace or \underbrace. Such constructions are 
considered to be large operators like \sum, so you can put limits above them or below 
them by specifying superscripts or subscripts, as in the following examples: 


$$\overbrace{x+\cdots+x}"{k\rm\ ; times}$$ 


x + ■ ■ ■ + x 


noalign 

Horizontal braces 

overbrace 

underbrace 

array 

matrix 

matrix 

lambda 

pmatrix 

lgroup 

rgroup 


$$\underbrace{x+y+z}_{>\ , 0} . $$ x y z . 

> o 

8. Matrices. Now comes the fun part. Mathematicians in many different disci- 
plines like to construct rectangular arrays of formulas that have been arranged 
in rows and columns; such an array is called a matrix. Plain TgX provides a. 
\matrix control sequence that makes it convenient to deal with the most com- 
mon types of matrices. 

For example, suppose that you want to specify the display 

(x - A 1 0 \ 

A = | 0 x — X 1 . 

\ 0 0 x — X J 

All you do is type 

$$A=\left (\matrix{x-\lambda&l&0\cr 
0&x-\lambda&l\cr 
0&0&x-\lambda\cr}\right) . $$ 

This is very much like the \cases construction we looked at earlier; each row of 
the matrix is followed by \cr, and signs are used between the individual entries 
of each row. Notice, however, that you are supposed to put your own \left and 
\right delimiters around the matrix; this makes \matrix different from \cases, 
which inserts a. big automatically. The reason is that \cases always involves 
a. left brace, but different delimiters are used in different matrix constructions. 
On the other hand, parentheses are used more often than other delimiters, so 
you can write \pmatrix if you want plain TgjX to fill in the parentheses for you; 
the example above then reduces to 


$$A=\pmatrix{x-\lambda&. . . &x-\lambda\cr} . $$ 



► EXERCISE 18.24 

Typeset, the display 





' u 

X ' 

f a. 

b 

c 



Id 

e 

/ J 

V 

y 



, W 



, using \lgroup and \rgroup. 
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f Tlie individual entries of a matrix are normally centered in columns. Each 
column is made as wide as necessary to accommodate the entries it contains, 
and there’s a quad of space between columns. If you want to put something flush 
right in its column, precede it by \hf ill; if you want to put something flush left in its 
column, follow it by \hf ill. 

f Each entry of a matrix is treated separately from the others, and it is typeset 
as a math formula in text style. Thus, for example, if you say \rm in one entry, 
it does not affect the others. Don’t try to say ‘{\rm x&y}’. 


Matrices often appear in the form of generic patterns that use ellipses 
(i.e., dots) to indicate rows or columns that are left out. You can typeset such 
matrices by putting the ellipses into rows and/or columns of their own. Plain 
T^X provides \vdots (vertical dots) and \ddots (diagonal dots) as companions 
to \ldots for constructions like this. For example, the generic matrix 


Oil 

0'12 

• • Q>ln \ 

021 

«22 

• • &2n 

Oml 

Om2 

• • Q"mn J 


is easily specified: 

$$A=\pmatrix{a_{ll}&a_{12}&\ldots&a_{ln}\cr 

a_{21}&a_{22}&\ldots&a_{2n}\cr 

\vdots&\vdots&\ddots&\vdots\cr 

a_{ml}&a_{m2}&\ldots&a_{mn}\cr}$$ 


quad 

flush right 
hfill 

flush left 

ellipses 

vdots 

ddots 

ldots 

generic matrix 
column vector 
vector 

bordermatrix 


► EXERCISE 18.25 

How can you get TgX to produce the column vector 



? 


Sometimes a matrix is bordered at the top and left 
labels to the rows and columns. Plain TjjX provides 
\bordermatrix for this situation. For example, the display 



by formulas that give 
a special macro called 


C I C' 

<7/1 0 0 \ 

M = I lb 1-6 0 

C \ 0 a l- a) 

is obtained when you type 

$$M=\bordermatr ixf&C&I&C ’ \cr 

C&l&0&0\cr l&b&l-b&0\cr C’&0&a&l-a\cr}$$ 


The first row gives the upper labels, which appear above the big left and right paren- 
theses; the first column gives the left labels, which are typeset flush left, just before 
the matrix itself. The first column in the first row is normally blank. Notice that 
\bordermatrix inserts its own parentheses, like \pmatrix does. 
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f it’s usually inadvisable to put matrices into the text of a paragraph, be- 
cause they are so big that they are better displayed. But occasionally you 
may want to specify a small matrix like (“), which you can typeset for example as 
l $l\,l\chooseO\,l$’. Similarly, the small matrix (" ^ “) can be typeset as 

$\bigl ({a\atop l}{b\atop m}{c\atop n}\bigr)$ 

The \matrix macro does not produce small arrays of this sort. 

9. Vertical spacing. If you want to tidy up an unusual formula, you know already 
how to move things farther apart or closer together, by using positive or negative 
thin spaces. But such spaces affect only the horizontal dimension; what if you 
want something to be moved higher or lower? That’s an advanced topic. 

f Appendix B provides a few macros that can be used to fool IjrX into thinking 
that certain formulas are larger or smaller than they really are; such tricks can 
be used to move other parts of the formula, up or down or left or right. For example, we 
have already discussed the use of \mathstrut in Chapter 16 and \strut in Chapter 17; 
these invisible boxes caused TpX to put square root signs and the denominators of 
continued fractions into different positions than usual. 

f lf you say ‘\phantom{ {subformula)}’ in any formula, TjjX will do all of its 
spacing as if you had said simply ‘{(subformula)}’, but the subformula itself 
will be invisible. Thus, for example, ‘\phantom{0}2’ takes up just as much space as 
‘02’ in the current style, but only the 2 will actually appear on the page. If you want 
to leave blank space for a new symbol that has exactly the same size as but if you 
are forced to put that symbol in by hand for some reason, ‘\mathop{\phantom\sum}’ 
will leave exactly the right amount of blank space. (The ‘\mathop’ here makes this 
phantom behave like \sum, i.e., as a large operator.) 

f Even more useful than \phantom is \vphantom, which makes an invisible box 
whose height and depth are the same as those of the corresponding \phantom, 
but the width is zero. Thus, \vphantom makes a vertical strut that can increase a 
formula’s effective height or depth. Plain defines \mathstrut to be an abbreviation 
for ‘\vphantom(’. There’s also \hphantom, which has the width of a \phantom, but its 
height and depth are zero. 

f Plain TJtX also provides ‘\smash{ (subformula)}’, a macro that yields the same 
result as ‘{(subformula)}’ but makes the height and depth zero. By using 
both \smash and \vphantom you can typeset any subformula and give it any desired 
nonnegative height and depth. For example, 

\mathop{\ smash\limsup\vphantom\l iminf } 

produces a large operator that says ‘limsup’, but its height and depth are those of 
\liminf (i.e., the depth is zero). 

EXERCISE 18.26 

If you want to underline some text, you could use a macro like 
\def\undertext#l{$\underline{\hbox{#l}}$} 


choose 

matrix, small 
atop 

mathstrut 

strut 

phantom 

new symbol 

mathop 

vphantom 

strut 

hphantom 

smash 

liminf 

limsup 

underlined text 


to do the job. But this doesn’t always work right. Discuss better alternatives. 
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/gL/gL You can also use \raise and Mower to adjust the vertical positions of boxes 
X X in formulas. For example, the formula ‘$2“{\raiselpt\hbox{$\scriptstyle 
n$}}$’ will have its superscript n one point higher than usual (2" instead of 2" ). Note 
that it was necessary to say \scriptstyle in this example, since the contents of an 
\hbox will normally be in text style even when that lrbox appears in a superscript, and 
since \raise can be used only in connection with a box. This method of positioning 
is not used extremely often, but it is sometimes helpful if the \root macro doesn’t put 
its argument in a suitable place. For example, 

\root\raise(dimen)\hbox{$\scriptscriptstyle(argument)$}\of . . . 
will move the argument up by a given amount. 

/gK/gK Instead of changing the sizes of subformulas, or using \raise, you can also 
JL JL control vertical spacing by changing the parameters that TjrjX uses when it is 
converting math lists to horizontal lists. These parameters are described in Appen- 
dix G; you need to be careful when changing them, because such changes are global 
(i.e., not local to groups). Here is an example of how such a change might be made: 
Suppose that you are designing a format for chemical typesetting, and that you expect 
to be setting a lot of formulas like ‘Fe^Ci^O.j’. You may not like the fact that the 
subscript in Fe^ 2 is lower than the subscript in Cr 2 ; and you don’t want to force users 
to type monstrosities like 


$\rm Fe_2~{+2}Cr_2~{\vphantom{+2}}0_4“{\vphantom{+2}}$ 

just to get the formula FeJ 2 Cr 2 0 4 with all subscripts at the same level. Well, all 
you need to do is set ‘\fontdimenl6\tensy=2 . 7pt’ and l \fontdimenl7\tensy=2 . 7pt’, 
assuming that \tensy is your main symbol font (\textfont2); this lowers all normal 
subscripts to a position 2.7 pt below the baseline, which is enough to make room for a 
possible superscript that contains a plus sign. Similarly, you can adjust the positioning 
of superscripts by changing \f ontdimenl4\tensy. There are parameters for the position 
of the axis line, the positions of numerator and denominator in a generalized fraction, 
the spacing above and below limits, the default rule thickness, and so on. Appendix G 
gives precise details. 


10. Special features for math hackers. TgX has a few more primitive operations 
for math mode that haven’t been mentioned yet. They are occasionally useful if 
you are designing special formats. 


/gL/gL If a glue or kern specification is immediately preceded by ‘\nonscript’, TjgX 
JL JL will not use that glue or kern in script or scriptscript styles. Thus, for example, 
the sequence ‘\nonscript\ ; ’ produces exactly the amount of space specified by ‘(3)’ 
in the spacing table for mathematics that appeared earlier in this chapter. 


/gK/gK Whenever TjjX has scanned a $ and is about to read a math formula that 
JL jl appears in text, it will first read another list of tokens that has been predefined 
by the command \everymath={ (token list)}. (This is analogous to \everypar, which 
was described in Chapter 14.) Similarly, you can say \everydisplay={ (token list)} to 
predefine a list of tokens for TgX to read just after it has scanned an opening $$, i.e., just 
before reading a formula that is to be displayed. With \everymath and \everydi splay, 
you can set up special conventions that you wish to apply to all formulas. 
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raise 

lower 

scriptstyle 

hbox 

root 

global 

chemical typesetting 

fontdimen 

tensy 

subscripts 

superscripts 

axis line 

numerator 

denominator 

fraction 

limits 

rule thickness 
nonscript 
everymath 
every display 
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11. Summary. We have discussed more different kinds of formulas in this chapter 
than you will usually find in any one book of mathematics. If you have faithfully 
done the exercises so far, you can face almost any formula with confidence. 

f But here are a few more exercises, to help you review what you have learned. 

Each of the following “challenge formulas” illustrates one or more of the prin- 
ciples already discussed in this chapter. The author confesses that he is trying to trip 
you up on several of these. Nevertheless, if you try each one before looking at the 
answer, and if you’re alert for traps, you should find that these formulas provide a 
good way to consolidate and complete your knowledge. 

►EXERCISE 18.27 

JL Challenge number 1: Explain how to type the phrase ‘n th root’, where ‘n th ’ 
is treated as a mathematical formula with a superscript in roman type. 

S -1 TS = dg(wi, . . . ,w„) = A. 

Pr( m = n \ m + n = 3 ). 
sin 18° = l(v / 5 — 1). 
k = 1.38 x 10 -16 erg/°K. 

<§ C NLl/N = L\ C • • • C NL*JN = L* n . 

= ffD9(x,y) e ‘ Xh(x,v) dxdy. 

fo ■ ■ ■ Jo f(' x 1 x n) dx 1 . . . d,x n . 


<2 

<t> 

<t> 

<^> 

<t> 

<^> 

<^> 


► EXERCISE 18.28 

Challenge number 2: 

► EXERCISE 18.29 

Challenge number 3: 

► EXERCISE 18.30 

Challenge number 4: 

► EXERCISE 18.31 

Challenge number 5: 

► EXERCISE 18.32 

Challenge number 6: 

► EXERCISE 18.33 

Challenge number 7: 

► EXERCISE 18.34 

Challenge number 8: 


► EXERCISE 18.35 

Challenge number 9: Here’s a display. 

r = /Q(A^-AH^)-25 2 
2m l p H x 'm ~ PiWfJ - 25 2 

► EXERCISE 18.36 

Challenge number 10: And another. 


(m odd) 
(m even) 


(mod N). 


(1 + X\Z + x lz +•••)... (1 + X n Z + X “2 4 ) = 

►EXERCISE 18.37 

JL Challenge number 11: And another. 


(1 - Xiz) ... (1 - x n z) ' 


II ( H " d"-' 

j > 0 ^k>0 


— ^ ^ z ( 'y ] a oio a ifci 

n>0 \ ko,ki,...>0 


ko-\-ki-\---=n 


degrees 
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►EXERCISE 18.38 

JL Challenge number 12: And, 

(ni + n 2 + • • • + n m )\ _ ( n \ + n 2 \ ( n \ + n - 2 + n 3 

no 


ni \ n 2 \ ■ .. n m \ 


n 3 


ni +n o + • • • + n ,- 

Orn 


► EXERCISE 18.39 

Challenge number 13: Yet another display. 


n R 


ai , a-2 , . . . , AM 

bi , &2 , ■ ■ ■ , b]y . 


n 


(1 _ g ai + ")(l — g“2 + n^ ^ _ qCiM+n ) 


11 (X — g f ’ 1 + ”)(l — g&2+») (1 _ qbN + n'j 


► EXERCISE 18.40 

Challenge number 14: And another. 


f ( P )= / f ( t ) dn ( t ). 


p prime 


/t>l 


/£> ►EXERCISE 18.41 

JL Challenge number 15: Still another. 

k a’ s 1 6’s 

-V— — ' 

&+/ elements 

/gK ►EXERCISE 18.42 

JL Challenge number 16: Put a \smallskip between the rows of matrices in the 
compound matrix 

f( ° h \ ( 6 ()\ 

Q h 




C (l: 

0 


l J 

k llj 


/£> ►EXERCISE 18.43 

JL Challenge number 17: Make the columns flush left here. 


> 0. 


<£)<£>► EXERCISE 18.44 

Challenge number 18: The main problem here is to prime the y\ 



co 

Cl 

Co 

• • 


Cl 

C2 

C3 

• • ^n+ 1 

det 

C-2 

C3 

C'4 

• • C n +2 


C n 

Cn-f- 1 

Cji+2 . 

• • C2n 


£'/(*)=£/(*)■ 


xeA 


xeA 

x^O 


smallskip 
compound matrix 
flush left 
sum prime 
=def 
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► EXERCISE 18.45 

Challenge number 19: 


You may be ready now for this display. 


2 tt k = f 2 2 ' 


k 


^►EXERCISE 18.46 

Challenge number 20: And finally, when you have polished off all the other 
examples, here’s the ultimate test. Explain how to obtain the commutative diagram 


commutative diagram 
matrix 

author, typesetting by 


o 


O c 


Oc — t tt*Od 


C 


<P 


R l f*O v (-D) — > 
R 1 f,{O v (-iM )) O 7 -1 


0 


using \matrix. (Many of the entries are blank.) 

12. Words of advice. The number of different notations is enormous and still 
growing, so you will probably continue to find new challenges as you continue to 
type mathematical papers. It’s a good idea to keep a personal notebook in which 
you record all of the non-obvious formulas that you have handled successfully, 
showing both the final output and what you typed to get it. Then you’ll be able 
to refer back to those solutions when you discover that you need to do something 
similar, a few months later. 

If you’re a mathematician who types your own papers, you have now 
learned how to get enormously complex formulas into print, and you can do so 
without going through an intermediary who may somehow distort their meaning. 
But please, don’t get too carried away by your newfound talent; the fact that you 
are able to typeset your formulas with Tp;X doesn’t necessarily mean that you 
have found the best notation for communicating with the readers of your work. 
Some notations will be unfortunate even when they are beautifully formatted. 
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GOETHE 

HALMOS 


Mathematicians are like Frenchmen: 
whenever you say something to them, they translate it into their own language, 

and at once it is something entirely different. 
— GOETHE, Maxims and Reflexions (1829) 

The best notation is no notation; 
whenever it is possible to avoid the use of a complicated alphabetic apparatus, 

avoid it. 

A good attitude to the preparation of written mathematical exposition 

is to pretend that it is spoken. 
Pretend that you are explaining the subject to a friend 
on a long walk in the woods, with no paper available; 
fall back on symbolism only when it is really necessary. 

— PAUL HALMOS, How to Write Mathematics (1970) 
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By now you know how to type mathematical formulas so that T^X will handle 
them with supreme elegance; your knowledge of math typing is nearly complete. 
But there is one more part to the story, and the purpose of this chapter is to 
present the happy ending. We have discussed how to deal with individual formu- 
las; but displays often involve a whole bunch of different formulas, or different 
pieces of a huge formula, and it’s a bit of a problem to lay them out so that they 
line up properly with each other. Fortunately, large displays generally fall into 
a few simple patterns. 

1. One-line displays. Before plunging into the general question of display layout, 
let’s recapitulate what we have already covered. If you type ‘$$ (formula) $$’, 
TgX will display the formula in flamboyant display style, centering it on a line 
by itself. We have also noted in Chapter 18 that it’s possible to display two 
short formulas at once, by typing ‘$$(formulai)\qquad(formulao)$$’; this reduces 
the two-formula problem to a one-formula problem. You get the two formulas 
separated by two quads of space, the whole being centered on a line. 

Displayed equations often involve ordinary text. Chapter 18 explains 
how to get roman type into formulas without leaving math mode, but the best 
way to get text into a display is to put it into an \hbox. There needn’t even be 
any math at all; to typeset 

Displayed Text 

you can simply say ‘$$\hbox{Displayed Text}$$’. But here’s a more interesting 
example: 

X n = X k if and only if Y n = Y k and Z n = Z k . 

Formulas and text were combined in this case by typing 

$$X_n=X_k \qquad\hbox{if and only if}\qquad 
Y_n=Y_k \quad\hbox{and}\quad Z_n=Z_k.$$ 

Notice that \qquad appears around ‘if and only if’, but a single \quad surrounds 
‘and’; this helps to indicate that the Y and Z parts of the display are related 
more closely to each other than to the X part. 

Consider now the display 

Y n = X n mod p and Z n = X n mod q for all n > 0. 

Can you figure out how to type this? One solution is 

$$Y_n=X_n\bmod p \quad\hbox{and}\quad Z_n=X_n\bmod q 
\qquad\hbox{f or all }n\geO.$$ 

Notice that a space has been left after ‘all’ in the hbox here, since spaces 
disappear when they are out in formula-land. But there’s a simpler and more 
logical way to proceed, once you get used to Tf^X’s idea of modes: You can type 

... \qquad\hbox{f or all $n\geO$.}$$ 


displays 

dollardollar 

qquad 

hbox 

quad 
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Wow — that’s math mode inside of horizontal mode inside of display math mode. 
But in this way your manuscript mirrors what you are trying to accomplish, 
while the previous solution (with the space after ‘all’) looks somewhat forced. 


► EXERCISE 19.1 

Typeset the following four displays (one at a time): 


£< 

n=0 


converges if |^| < (limsup \/\a r , 

' n — Vor, 


f{x + Ax) - f(x) 

Ax 

llu<ll = 1. 


— > f'(x ) as Ax — » 0. 

U j ■ Uj =0 if i f j. 


j 

j* an arc 'j 

| j 

j* an arc '| 

The confluent image of < 

a circle 


an arc or a circle > 

1 

{a fan J 

1 1 

[ a fan or an arc J 


/gK ►EXERCISE 19.2 

JL Sometimes display style is too grandiose, when the formula being displayed is 

1 

y = —x 
y 2 

or something equally simple. One day B. L. User tried to remedy this by typing it as 
‘$$y={\scriptstylel\over\scriptstyle2}x$$’, but the resulting formula 

1 

y = -x 

2 

wasn’t at all what he had in mind. What’s the right way to get simply ‘y = |a:’ when 
you don’t want big fractions in displays? 

►EXERCISE 19.3 

JL What difference, if any, is there between the result of typing ‘$$ (formula) $$’ 
and the result of typing ‘$$\hbox{$(formula)$}$$’ ? 

►EXERCISE 19.4 

JL You may have noticed that most of the displays in this manual are not cen- 
tered; displayed material is usually aligned at the left with the paragraph indentation, 
as part of the book design, because this is an unusual book. Explain how you could 
typeset a formula like 

.-H-L 

that is off-center in this way. 


If you’ve had previous experience typing mathematical papers, you prob- 
ably have been thinking, “What about equation numbers? When is this book 
going to talk about them?” Ah yes, now is the time to discuss those sneaky little 
labels that appear off to the side of displays. If you type 


$ $ (formula) \ eqno (formula) $ $ 


Delta 
Vert 
User 
one half 

1/2-unslashed form 
fractions in displays 
displays, non-centered 
equation numbers 
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TgX will display the first formula and it will also put an equation number (the 
second formula) at the right-hand margin. For example, 

$$x~2-y~2 = (x+y) (x-y) .\eqno(15)$$ 

will produce this: 

x 2 - y 2 = ( x + y) (x-y). (15) 

You can also get equation numbers at the left-hand margin, with \leqno. For 
example, 

$$x~2-y~2 = (x+y) (x-y) . \leqno(16)$$ 


will produce this: 


- V 2 = (x + y)(x - y). 


Notice that you always give the equation number second, even when it is going 
to appear at the left. Everything from the \eqno or \leqno command to the $$ 
that ends the display is the equation number. Thus, you’re not allowed to have 
two equation numbers in the same display; but there’s a way to get around that 
restriction, as we’ll see later. 

f Nowadays people are using right-hand equation numbers more and more, be- 
cause a display most often comes at the end of a sentence or clause, and the 
right-hand convention keeps the number from intruding into the clause. Furthermore, 
it’s often possible to save space when a displayed equation follows a short text line, 
since less space is needed above the display; such savings are not possible with \leqno, 
because there’s no room for overlap. For example, there is less space above display (15) 
than there is above (16) in our illustrations of \eqno and \leqno, although the formulas 
and text are otherwise identical. 

f lf you look closely at (15) and (16) above, you can see that the displayed 
formulas have been centered without regard to the presence of the equation 
numbers. But when a formula is large, T^X makes sure that it does not interfere with 
its number; the equation number may even be placed on a line by itself. 

► EXERCISE 19.5 

How would you produce the following display? 


(1 — q k z) 


n a-*/*)- 


►EXERCISE 19.6 

JL Equation numbers are math formulas, typeset in text style. So how can you 
get an equation number like ‘(3-1)’ (with an on dash)? 

<§)(£)*■ EXERCISE 19.7 

JL JL B. L. User tried typing l \eqno(*)’ and ‘\eqno (**)’, and he was pleased to 
discover that this produced the equation numbers “(*)’ and “(**)’. [He had been a bit 
worried that they would come out “(*)’ and “(**)’ instead.] But then a few months 
later he tried ‘\eqno (***)’ and got a surprise. What was it? 


eqno 

leqno 

en-dash 

User 
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Somewhere in this manual there ought to be a description of exactly how 
JL JL T].\T displays formulas; i.e. , how it centers them, how it places the equation 
numbers, how it inserts extra space above and below, and so on. Well, now is the time 
for those rules to be stated. They are somewhat complex, because they interact with 
things like \par shape, and because they involve several parameters that haven’t been 
discussed yet. The purpose of the rules is to explain exactly what sorts of boxes, glue, 
and penalties are placed onto the current vertical list when a display occurs. 


/>>/>) If a display occurs after, say, four lines of a paragraph, TgX’s internal register 
JL JL called \prevgraf will be equal to 4 when the display starts. The display will be 
assumed to take three lines, so \prevgraf will become 7 when the paragraph is resumed 
at the end of the display (unless you have changed \prevgraf in the meantime). TgX 
assigns special values to three (dimen) parameters immediately after the opening $$ is 
sensed: \displaywidth and \displayindent are set to the line width z and the shift 
amount s for line number \prevgraf + 2, based on the current paragraph shape or hang- 
ing indentation. (Usually \displaywidth is the same as \hsize, and \displayindent 
is zero, but the paragraph shape can vary as described in Chapter 14.) Furthermore, 
\predisplaysize is set to the effective width p of the line preceding the display, as 
follows: If there was no previous line (e.g., if the $$ was preceded by \noindent or by 
the closing $$ of another display), p is set to —16383.99999 pt (i.e., to the smallest legal 
dimension, — \maxdimen). Otherwise T)LX looks inside the hbox that was formed by the 
previous line, and sets p to the position of the right edge of the rightmost box inside 
that hbox, plus the indentation by which the enclosing hbox has been moved right, plus 
two ems in the current font. However, if this value of p depends on the fact that glue in 
that hbox was stretching or shrinking — for example, if the \parfillskip glue is finite, 
so that the material preceding it has not been set at its natural width — then p is set to 
\maxdimen. (This doesn’t happen often, but it keeps l^X machine independent, since 
p never depends on quantities that may be rounded differently on different computers. ) 
Notice that \displaywidth and \displayindent are not affected by \leftskip and 
\rightskip, but \predisplaysize is. The values of \displaywidth, \displayindent, 
and \predisplaysize will be used by TJzX after the displayed formula has been read, 
as explained below; your program can examine them and/or change them, if you want 
the typesetting to be done differently. 


/>>/>) After a display has been read, T^X converts it from a math list to a horizontal 
JL JL list h in display style, as explained in Appendix G. An equation number, if 
present, is processed in text style and put into an hbox a with its natural width. Now 
the fussy processing begins: Let z, s, and p be the current values of \displaywidth, 
\displayindent, and \predisplaysize. Let q and e be zero if there is no equation 
number; otherwise let e be the width of the equation number, and let q be equal to 
e plus one quad in the symbols font (i.e., in \textfont2). Let w o be the natural width 
of the displayed formula h. If wo + q < z, list h is packaged in an hbox b having its 
natural width wq. But if ico + q > z (i.e., if the display is too wide to fit. at its natural 
width), T)?X performs the following “squeeze routine”: If e ^ 0 and if there is enough 
shrinkability in the displayed formula h to reduce its width to z — q, then list h is 
packaged in an hbox b of width z — q. Otherwise e is set to zero, and list h is packaged 
in a (possibly overfull) hbox b of width min(ico, z). 



(Continuation.) TgX tries now to center the display without regard to the 
equation number. But if such centering would make it too close to that number 


vertical list 

prevgraf 

displaywidth 

displayindent 

hsize 

predisplaysize 

noindent 

maxdimen 

parfillskip 

textfont 

squeeze routine 
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(where “too close” means that the space between them is less than the width e), the 
equation is either centered in the remaining space or placed as far from the equation 
number as possible. The latter alternative is chosen only if the first item on list h is 
glue, since IjrX assumes that such glue was placed there in order to control the spacing 
precisely. But let’s state the rules more formally: Let w be the width of box b. TjrX 
computes a displacement d , to be used later when positioning box 6, by first setting 
d = j(z — w). If e > 0 and if d < 2e, then d is reset to ^(z — w — e) or to zero, where 
zero is chosen if list h begins with a glue item. 

/gK/gK (Continuation.) TgX is now ready to put things onto the current vertical list, 
JL JL just after the material previously constructed for the paragraph-so-far. First 
comes a penalty item, whose cost is an integer parameter called \predisplaypenalty. 
Then conies glue. If d + s < p, or if there was a left equation number (\leqno), 
TjgX sets g a and gt to glue items specified by the parameters \abovedisplayskip and 
\belowdisplayskip, respectively: otherwise g a and gb become glue items correspond- 
ing to \abovedisplayshortskip and \belowdisplayshortskip. [Translation: If the 
predisplaysize is short enough so that it doesn’t overlap the displayed formula, the glue 
above and below the display will be “short” by comparison with the glue that is used 
when there is an overlap.] If e = 0 and if there is an \leqno, the equation number is 
appended as an hbox by itself, shifted right s and preceded by interline glue as usual; 
an infinite penalty is also appended, to prevent a page break between this number and 
the display. Otherwise a glue item g a is placed on the vertical list. 

/gL/gL (Continuation.) Now conies the displayed equation itself. If e ^ 0, the 
JL JL equation number box a is combined with the formula box b as follows: Let k 
be a kern of width z — w — e — d. In the \eqno case, box b is replaced by an hbox 
containing ( b , k, a); in the \leqno case, box b is replaced by an hbox containing (a, k, b), 
and d is set to zero. In all cases, box b is then appended to the vertical list, shifted 
right by s + d. 

/gK/gK (Continuation.) The final task is to append the glue or the equation number 
JL JL that follows the display. If there was an \eqno and if e = 0, an infinite 
penalty is placed on the vertical list, followed by the equation number box a shifted 
right by s + z minus its width, followed by a penalty item whose cost is the value 
of \postdisplaypenalty. Otherwise a penalty item for the \postdisplaypenalty is 
appended first, followed by a glue item for gt as specified above. IjjX now adds 3 to 
\prevgraf and returns to horizontal mode, ready to resume the paragraph. 

/gL/gL One consequence of these rules is that you can force an equation number to 
JL JL appear on a line by itself by making its width zero, i.e., by saying either 
‘\eqno\llap{$(formula)$}’ or ‘\leqno\rlap{$(formula)$}’. This makes e = 0, and the 
condition e = 0 controls TjrX’s positioning logic, as explained in the rules just given. 

/gL/gL Plain TjrX sets \predisplaypenalty=10000, because fine printers tradition- 
JL JL ally shun displayed formulas at the very top of a page. You can change 
\predisplaypenalty and \postdisplaypenalty if you want to encourage or discourage 
page breaks just before or just after a display. For example, ‘$$\postdisplaypenalty= 
-10000{formula)$$’ will force a page break, putting the formula at the bottom line. It 
is better to force a page break this way than to say \eject right after $$. . .$$; such 
an eject (which follows the \belowdisplayskip glue below the display) causes the page 
to be short, because it leaves unwanted glue at the bottom. 


penalty 

predisplaypenalty 

abovedisplay skip 

belowdisplayskip 

abovedisplayshortskip 

belowdisplayshortskip 

postdisplaypenalty 

llap 

rlap 

page break 
eject 
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/>>/>)► EXERCISE 19.8 

JL JL Read the rules carefully and deduce the final position of ‘x = y' in the formula 

$$\quad x=y \hskipl0000pt minus lfil \eqno(5)$$ 

assuming that there is no hanging indentation. Also consider \leqno instead of \eqno. 

/gK/gK TgX also allows “alignment displays,” which are not processed in math mode 
JL JL because they contain no formulas at the outer level. An alignment display is 
created by commands of the general form 

$$ (assignments) \halign{ (alignment) } (assignments) $$ 

where the (assignments) are optional things like parameter changes that do not produce 
any math lists. In such displays, the \halign is processed exactly as if it had appeared 
in vertical mode, and it will construct a vertical list v as usual, except that each row 
of the alignment will be shifted right by the \displayindent. After the alignment and 
the closing assignments have been processed, TpX will put a \predisplaypenalty item 
and some \abovedisplayskip glue on the main vertical list, followed by v, followed by 
a \postdisplaypenalty item and \belowdisplayskip glue. Thus, alignment displays 
are essentially like ordinary alignments, except that they can interrupt paragraphs; 
furthermore, they are embedded in glue and penalties just like other displays. The 
\displaywidth and \predisplaysize do not affect the result, although you could use 
those parameters in your \halign. An entire alignment display is considered to be only 
three lines long, as far as \prevgraf is concerned. 

2. Multi-line displays. OK, the use of displayed formulas is very nice. But when 
you try typing a lot of manuscripts you will run into some displays that don’t 
fit the simple pattern of a one-line formula with or without an equation num- 
ber. Plain T^X provides special control sequences that will cover most of the 
remaining cases. 

Multi-line displays usually consist of several equations that should be 
lined up by their ‘=’ signs, as in 

Ad + • • • + X p = m, 

Yi + • • • + Yq = 11. 

The recommended procedure for such a display is to use \eqalign, which works 
with special markers & and \cr that we have already encountered in connection 
with \cases and \matrix in Chapter 18. Here’s how to type this particular one: 

$$\eqalign{X_l+\cdots+X_p&=m, \cr 

Y_l+\cdots+Y_q&=n. \cr}$$ 

There can be any number of equations in an \eqalign; the general pattern is 

\eqalign{(left-hand sidei)&(right-hand sidei)\cr 
(left-hand sideo)& (right-hand sideo)\cr 


alignment displays 

displayindent 

predisplaypenalty 

abovedisplay skip 

postdisplaypenalty 

belowdisplayskip 

displaywidth 

predisplaysize 

halign 

prevgraf 

eqalign 

ampersand 

cr 


(left-hand side n ,)&(right-hand side„)\cr} 
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where each (right-hand side) starts with the symbol on which you want alignment 
to occur. For example, every right-hand side often begins with an = sign. The 
equations will be typeset in display style. 


► EXERCISE 19.9 

In practice, the left-hand sides of aligned formulas are often blank, and the 
alignment is often done with respect to other symbols as well as =. For example, 
the following display is typical; see if you can guess how the author typed it: 

T(n) < T( 2^"1) < c(3 rig nl -2^ g "l) 

< 3c • 3 lg ” 

= 3cn lg3 . 


The result of \eqalign is a vertically centered box. This makes it easy 
to get a formula like 


f a = f(z) 1 
P = f{z 2 ) 

{ 7 = f(z 3 ) } 


x = a 2 — 0 1 

y = 27 j 


You simply use \eqalign twice in the same line: 


$$\left\{ 

\eqalign{\alpha&=f (z) \cr \beta&=f (z~2) \cr \gamma&=f (z~3) \cr} 
\right\}\qquad\left\{ 

\eqalign{x&=\alpha~2-\beta\cr y&=2\gamma\cr}\right\} . $$ 


► EXERCISE 19.10 

Try your hand at the numbered two-line display 


P(x) = a,Q + ci\x + cinx 2 + ■ ■ ■ + a n x n , 
P(—x) = ao — a.\x + aox 2 — ■ ■ ■ + ( — 1 ) n d n x n . 


(30) 


[Hint; Use the fact that \eqalign produces a vertically centered box; the equa- 
tion number ‘(30)’ is supposed to appear halfway between the two lines.] 


► EXERCISE 19.11 

What happens if you forget the & in one equation of an \eqalign? 

f Multi-line formulas sometimes fit together in odd ways, and you’ll find that 
every once in a while you will want to move certain lines farther apart or 
closer together. If you type ‘\noalign{\vskip(glue)}’ after any \cr, TjgX will insert 
the given amount of extra glue just, after that particular line. For example, 

\noalign{\vskip3pt} 


noalign 

vskip 


will put 3 pt of additional space between lines. You can also change the amount of 
space before the first line, in the same way. 



192 Chapter 19: Displayed Equations 


The next level of complexity occurs when you have several aligned equa- 
tions with several equation numbers. Or perhaps some of the lines are numbered 
and others are not: 


{x + y)(x - y) = x 2 - x y + yx - y 2 

= x 2 - y 2 ; (4) 

(x + y) 2 = x 2 + 2 xy + y 2 . (5) 

For this situation plain TgX provides \eqalignno; you use it like \eqalign, but 
on each line that you want an equation number you add ‘&{equation number)’ 
just before the \cr. The example above was generated by 

$$\eqalignno{ (x+y) (x-y)&=x~2-xy+yx-y~2\cr 
&=x~2-y~2;&(4)\cr 
(x+y) ~2&=x~2+2xy+y~2 .& (5) \cr}$$ 


Notice that the second & is omitted unless there’s an equation number. 

And there’s also \leqalignno, which puts equation numbers at the left. 
In this case it is appropriate to move the ‘(4)’ to the beginning of its equation: 

(4) (x + y)(x - y) = x 2 - xy + yx - y 2 

2 2 
= x -y ; 

(5) (x + y) 2 = x 2 + 2 xy + y 2 . 

Although the equation numbers appear at the left, you are still supposed to input 
them at the right, just as you do with \leqno; in other words, you should type 
•$$\leqalignno{(x+y) (x-y )&. . .&(4)\cr . . .}$$’ to get the previous display. 

Caution: \eqalignno and \leqalignno both center the set of equations 
without regard to the widths of the equation numbers. If the equations or their 
numbers get too wide, they might overlap, yet no error message will be given. 


► EXERCISE 19.12 

Typeset the following display: 


( 9 ) 

( 10 ) 


gcd (u,v) = gcd (v,u); 
gcd (u,v) = gcd (—u,v). 


► EXERCISE 19.13 

And here’s another one to try, just to keep in practice: 

/ POO \ 2 poo pc <5 

(J e J dxj = J J c { * ' '' ' dx (I y 


—co J — oo 
27T poo 


0 Jo 


r27r / e~ r 


JO 
— 7 r. 


r=0 


d,0 


( 11 ) 


eqalignno 

leqalignno 

gcd 

int 
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f Although \eqalign and \eqalignno look nearly the same, there’s really a 
fundamental distinction between them: \eqalign makes a single, vertically 
centered box, which is no wider than it needs to be; but \eqalignno generates a set of 
lines that have the full display width (reaching all the way to both margins). Thus, for 
example, you can use \eqalign several times in a display, but \eqalignno can appear 
only once. If you try to use \eqno in conjunction with \eqalign, you get a decent 
result, but if you try to use \eqno in connection with \eqalignno you’ll get some sort 
of weird error message(s). 

/>)/>) The definitions in Appendix B reveal why \eqalign and \eqalignno be- 
JL JL have differently: \eqalign is an abbreviation for \vcenter{\halign{ . . . 
while \eqalignno is an abbreviation for \halign to\displaywidth{ . . . }; thus the 
\eqalignno macro generates an “alignment, display.” 

/>)/>) This difference between \eqalign and \eqalignno has two interesting con- 
JL JL sequences. (1) It’s impossible to break an \eqalign between pages, but an 
\eqalignno can be broken. In fact, you can force a page break after a particular line if 
you insert ‘\noalign{\break}’ after the \cr for that line. You can prohibit all breaks 
in an \eqalignno if you set \interdisplaylinepenalty=10000; or you can enclose the 
whole works in a \vbox: 


eqno 

vcenter 

halign 

alignment display 
page break 
noalign 
break 

interdisplay linepenalty 

vbox 

User 


$$\vbox{\eqalignno{ . . .}}$$ 

(2) You can also insert a line of text between two equations, without losing the align- 
ment. For example, consider the two displays 

x = y + z 


2 2.2 
x = y + z . 


These were actually generated as a single display by typing 

$$\eqalignno{x&=y+z\cr 
\noalign{\hbox{and}} 
x"2&=y~2+z"2 . \cr}$$ 

Therefore the fact that their = signs line up is not just a lucky coincidence. Sometimes 
you will want to adjust the spacing above or below such a line of inserted text, by 
putting a \vskip or two inside of the \noalign{. . Incidentally, this example also 
shows that it is possible to use \eqalignno without giving any equation numbers. 

EXERCISE 19.14 

JL JL What happens if \eqalign is substituted for \eqalignno in this last example? 

(£)(£)*■ EXERCISE 19.15 

JL JL Our friend Ben User got into trouble again when he tried to move an equation 
number up higher than its usual position, by typing this: 


$$\eqalignno{ . . . &\raise6pt\hbox{ (5) }\cr}$$ 
What was his oversight, and what could he have done instead? 
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For other types of displays, plain TjjX provides \displaylines, which lets you 
display any number of formulas in any way you want, without any alignment. 
The general form is 



$$\displaylines{ (displayed formulai)\cr 
(displayed formula 2 )\cr 


(displayed formula,, )\cr}$$ 

Each formula will be centered, because \displaylines puts \hfil at the left and the 
right of each line; you can override this centering to get things flush left or flush right 
by inserting \hf ill, which takes precedence over \hfil. 

►EXERCISE 19.16 

JL Use \displaylines to typeset the three-line display 


x = x\ 

if x = y then y = x\ 
if x = y and y = z then x = z. 


( 1 ) 

( 2 ) 

(3) 


f lf you look closely at the multi-line displays in this chapter, you’ll see that the 
baselines are farther apart than they are in normal text; mathematics publish- 
ers generally do this in order to make the displays easier to read. In accordance with 
this tradition, \eqalign and its relatives automatically increase the \baselineskip. 
If you are making a multi-line display with IjrX’s primitive \halign command, instead 
of using one of the plain IgX macros, you might want to make this same baseline ad- 
justment, and you can do it easily by saying ‘$$\openupl\jot \halign{. . .}$$’. The 
\openup macro increases \lineskip and \lineskiplimit as well as \baselineskip. 
If you say ‘\openup2\ jot’, the lines are spread apart 2 extra units, where plain TjrX 
opens things up in units of 3pt. Since $$...$$ acts as a group, the effect of \openup 
will disappear when the display is finished. Any (dimen) can follow \openup, but it’s 
customary to express the amount symbolically in terms of a \jot instead of using 
absolute units; then your manuscript can be used with a variety of different formats. 

/>)/>) Plain TjrX’s \displaylines, \eqalignno, and \leqalignno macros begin with 
JL JL ‘\openupl\jot’. If you don’t want the lines to be opened up, you can cancel 
this by saying, e.g., ‘$$\openup-l\jot \eqalignno{ . . . because \openup has a 
cumulative effect. 

/>)/>) Suppose that you have decided to make a homegrown display having the gen- 
JL JL eral form ‘$$\openupl\ jot \halign{. . .}$$’; and for convenience, let’s sup- 
pose that the normal conventions of plain TjrX are in force, so that \jot=3pt and 
\baselineskip=12pt. Then the \openup macro changes the baselineskip distance to 
15 pt. It follows that the baseline of the text line that immediately precedes the display 
will be 15 pt above the topmost baseline of the display, plus the \abovedisplayskip. 
But when the paragraph resumes, its next baseline will be only 12 pt below the bot- 
tom baseline of the display, plus the \belowdisplayskip, because the \baselineskip 
parameter will have reverted to its normal value. The \eqalignno and \displaylines 
macros say ‘\noalign{\vskip— dY before their first lines, where d is the net amount 
of opening-up, in order to compensate for this difference. 


displaylines 

hfil 

hfill 

baselineskip 

halign 

openup 

lineskip 

lineskiplimit 

group 

jot 

generic coding 

displaylines 

eqalignno 

leqalignno 

abovedisplay skip 

belowdisplayskip 
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3. Long formulas. Our discussion of mathematics typing is almost complete; we break long displayed formulas 
need to deal with just one more problem: What should be done when a formula 
is so long that it doesn’t fit on a single line? 

For example, suppose that you encounter the equation 

<t(2 34 — 1, 2 35 , 1) = — 3+(2 34 — 1)/2 35 +2 3 7(2 34 — 1)+7/2 35 (2 34 — 1)— <t(2 35 ,2 34 — 1,1). 

You’ll have to break it up somehow; has done its best to squeeze everything 
together by shrinking the spaces next to the + and — signs to zero, but still the 
line has come out overfull. 

Let’s try to break that equation just before the ‘+7’. One common way 
to do this is to type 

$$\eqalign{\sigma(2"{34}-l ,2" {35} , 1) 

&=-3+(2~{34}-l)/2~{35}+2~{35}\ ! / (2~{34}-l)\cr 
&\qquad+7/2~{35} (2~{34}-l) -\sigma(2~{35} ,2~{34}-l , 1) . \cr}$$ 

which yields 

<t(2 34 - 1, 2 35 , 1) = -3 + (2 34 - l)/2 35 + 2 35 / (2 34 - 1) 

+ 7/2 35 (2 34 - 1) - <t(2 35 , 2 34 - 1, 1). 

The idea is to treat a long one-line formula as a two-line formula, using \qquad 
on the second line so that the second part of the formula appears well to the 
right of the = sign on the first line. 


► EXERCISE 19.17 

Explain how to deal with the following display. 

x n ui -i 1- x n+t -i U t = x n ui + (ax n + c)u -2 H 

+ (a t ~ 1 x n + c(a t ~' 2 + ■ ■ ■ + 1 ))u t 
= ( Mi + au 2 + • • • + a t ~ 1 u t )x„ + h(ui, . . . ,u t ). (47) 

f it’s quite an art to decide how to break long displayed formulas into several 
lines; TpX never attempts to break them, because no set of rules is really 
adequate. The author of a mathematical manuscript is generally the best judge of 
what to do, since break positions depend on subtle factors of mathematical exposition. 
For example, it is often desirable to emphasize some of the symmetry or other structure 
that underlies a formula, and such things require a solid understanding of exactly what 
is going on in that formula. 



Nevertheless, it is possible to state a few rules of thumb about how to deal 
with long formulas in displays, since there are some principles that the best 


mathematical typesetters tend to follow: 


a) Although formulas within a paragraph always break after binary operations 
and relations, displayed formulas always break before binary operations and relations. 
Thus, we didn’t end the first line of our <r( . . . ) example with ‘ (2"{34}-l)+’; we ended 
it with ‘(2"{34}-l)’ and began the second line with “+’. 
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b) When an equation is broken before a binary operation, the second line should 
start at least two quads to the right of where the innermost subformula containing that 
binary operation begins on the first line. For example, if you wish to break 

$$\sum_{0<k<n}\left ({formulai )+{formula 2 )\right)$$ 

at the plus sign between (formulai) and (formula 2 ), it is almost mandatory to have the 
plus sign on the second line appear somewhat to the right of the large left parenthesis 
that corresponds to ‘\left(’. 

f in the example just considered, special care is needed to break the formula into 
two lines, because \left and \right delimiters cannot be used in isolation; 
you can’t, have only \left in one line of a formula and only \right in the second. 
Furthermore, you’ll want the two delimiters to be of the same size, even though they 
occur in different lines. The best solution is usually to choose the delimiter size yourself; 
for example, you could type 

$$\eqalign{\sum_{0<k<n}\biggl (&{formula.i )\cr 

&\qquad{}+(formula. 2 )\biggr)\cr}$$ 


if \bigg delimiters are best. Notice that the & markers don’t occur at = signs in this 
example, they just mark a point of alignment. 

f There’s another way to break long formulas, sometimes called the two-line 
form. The idea is to put the first part of the formula almost flush left, and to 
put the second part almost flush right, where “almost flush” means “one quad away.” 
Thus, the two-line form of the long <r( . . . ) equation considered earlier is 

<t( 2 34 - 1, 2 35 , 1) = -3 + (2 34 - l)/2 35 + 2 35 /(2 34 - 1) 

+ 7/2 33 (2 34 - 1) - cr(2 33 , 2 34 - 1,1). 


It isn’t difficult to get this two-line effect, with \displaylines: 

$$\displaylines{\quad\sigma(2~{34}-l , 2 “{35} , 1) 

=-3+(2~{34}-l)/2~{35}+2"{35}\ ! / (2~{34}-l) \hf ill\cr 
\hf ill{}+7/2~{35} (2"{34}-l) -\sigma(2“{35} , 2"{34}-l , 1) ,\quad\cr}$$ 


An extra ‘O’ was typed on the second line here so that IjrX would know that the “+’ 
is a binary operation. The two-line form is especially recommended for equations that 
have a. long left-hand side; in that case the break generally conies just, before the = sign. 



► EXERCISE 19.18 

Typeset, the following display: 


E 

1<j <n 


(Xj 


1 

-Xj-l)(x -Xj)(Xj 


• 1 ) • • • J'.„ ) 

_ 1 

(x — Xl ) . . . (x — Xn ) ' 


( 27 ) 



► EXERCISE 19.19 

If it is necessary to typeset a. huge fraction like 


q2 n(n+1 \ea;q 2 )o 0 (eq/a;q 2 ) 00 (caq/e;q 2 ) 00 (cq 2 /ae;q 2 ) 00 


left 

right 

bigg 

two-line 

flush left 

flush right 

displaylines 

fraction, huge 


(e;g)oo(cg/e;g) OO 
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in a single narrow column, you might have to break up the numerator and resort to 
g i«.c:n+ i )( ea;g 2) oo ( eg / a ; 3 2) oo 

( caq/e ; q 2 )^(cq 2 /ae\ q 2 ) qq 

(e\q) oo(cg/e;g)oo 

How would you specify the latter fraction to l^rX? 


CHAUNDY 

BARRETT 

BATEY 

SWANSON 


When a formula is too long for the page-width 
and has to be broken into successive lines 
(and we are now, of course, speaking of displayed formulae), 
it should be broken, if possible, at the end of a natural 'phrase'; 

if, for example, it is a much-bracketed formula, 
it should be broken at the end of one of the major brackets 

and not at an inner symbol. 
This natural phrasing (as in music or speech) 
makes for intelligibility between writer and reader 
and should not be left to the compositor. 
An author, when he finds himself writing a iongish formula, 
should indicate a convenient point of fracture in case of need. 
— CHAUNDY, BARRETT, and BATEY, The Printing of Mathematics (1954) 

Some authors use display with discretion, 
some run even extremely long, complicated equations into the text, 
while others tend to display every equation in the paper. 
The tendency to overdisplay is probably more predominant 
than the tendency to underdisplay; 
for this reason it is possible for the copy editor to shorten 
(and even improve) papers by running displayed material into text. . . . 
On the other hand, there are occasions when the copy editor needs 
to suggest the display of complicated expressions that have been run into text, 
particularly when it would involve a bad break at the end of a text line. 

— ELLEN SWANSON, Mathematics into Type (1971) 



>age 198) 


r 


n 


20 

Definitions 
(also called Macros) 



Chapter 20: Definitions (also called Macros) 199 

You can often save time typing math formulas by letting control sequences stand 
for constructions that occur frequently in a particular manuscript. For example, 
if some document uses the vector ‘(x\, . . . ,x n )’ a lot, you can type 

\def \xvec{ (x_l , \ldots ,x_n) } 

and \xvec will henceforth be an abbreviation for 1 (x_l , \ldots ,x_n) Complex 
displays like 

^2 (/( ;c i > ■ ■ ■ > x ») + d(xi , • • • , £«)) 

i-' 1 1 i *• <0 

can then be typed simply as 

$$\sum_{\xvec\ne (0 , \ldots , 0) } \bigl (f \xvec+g\xvec\bigr ) $$ 

instead of in a tedious long form. By defining a control sequence like \xvec, you 
not only cut down on the number of keystrokes that you need to make, you also 
reduce your chances of introducing typographical errors and inconsistencies. 

Of course, you usually won’t be making a definition just to speed up 
the typing of one isolated formula; that doesn’t gain anything, because time 
goes by when you’re deciding whether or not to make a definition, and when 
you’re typing the definition itself. The real payoff comes when some cluster of 
symbols is used dozens of times throughout a manuscript. A wise typist will look 
through a document before typing anything, thereby getting a feeling for what 
sorts of problems will arise and what sorts of definitions will be helpful. For 
example, Chapter 16 recommends that the control sequence \Ahat be defined at 
the beginning of any manuscript that makes frequent use of the symbol A. 

Abbreviations like \xvec turn out to be useful in many applications of 
computers, and they have come to be known as macros because they are so 
powerful; one little macro can represent an enormous amount of material, so it 
has a sort of macroscopic effect. System programs like TgX that are designed 
to deal with macro definitions are said to expand the user’s macros; for ex- 
ample, \xvec expands into (x_l , \ldots ,x_n) , and \ldots in turn is a macro 
that expands into \mathinnerf\ldotp\ldotp\ldotp}. Thus, \xvec is actually 
an abbreviation for ‘ (x_l , \mathinnerf\ldotp\ldotp\ldotp} ,x_n) ’. (The ex- 
pansion stops here, because \mathinner is a primitive control sequence of TgX, 
and because \ldotp has been defined with \mathchardef ; thus \mathinner and 
\ldotp are not macros.) 

TgX users generally build up their own personal library of macros for 
things that they want to do in different documents. For example, it is common 
to have a file called macros.tex that contains definitions of your favorite spe- 
cial control sequences, perhaps together with commands that load your favorite 
special fonts, etc. If you begin a document with the command 

\ input macros 

then TgX will read all those definitions, saving you all the trouble of retyping 
them. Of course, TgX’s memory is limited, and it takes time to read a file, so 


control sequences 
defining a control sequence 
abbreviations, see macros 
macros 
ldots 

mathinner 
mathchardef 
library of macros 
input 
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you shouldn’t put thousands of definitions into macros.tex. A large collection 
of macro definitions (e.g., the set of definitions in Appendix B) is called a format, 
(e.g., “plain Tf.'X format”); TpX has a special way to input a format at high 
speed, assuming that the format doesn’t change very often. 

The \xvec and \Ahat examples apply to math formulas, but you can 
make good use of macro definitions even when you aren’t doing any math at all. 
For example, if you are using TgX for business correspondence, you can have a 
\yours macro that stands for ‘Sincerely yours, A. U. Thor’. If you often write 
form letters you can have macros that generate entire sentences or paragraphs 
or groups of paragraphs. The Internal Revenue Service could, for example, make 
use of the following two macros: 

\def \badcheck{A penalty has been added because your 
check to us was not honored by your bank.\par} 

\def \cheater{A penalty of 50Y/, of the underpaid tax 
has been added for fraud. \par} 

Simple macro definitions, like these, start with ‘\def’; then comes the control 
sequence name, e.g., ‘\badcheck’; and then comes the replacement text enclosed 
in and The braces do not represent grouping in this case; they simply 
show the extent of the replacement text in the definition. You could, of course, 
define a macro that includes actual braces in its replacement text, as long as 
those braces match each other properly. For example, ‘\def \xbold{{\bf x}}’ 
makes \xbold an abbreviation for ‘{\bf x}’. 


► EXERCISE 20.1 

Write a \punishment macro that prints 100 lines containing the message ‘I must 
not talk in class.’ [Hint: First write a macro \mustnt that prints the message 
once; then write a macro \f ive that prints it five times.] 



► EXERCISE 20.2 

What is the expansion of \puzzle, given the following definitions? 
\def\a{\b} 

\def\b{A\def\a{B\def\a{C\def\a{\b}}}} 

\def\puzzle{\a\a\a\a\a} 


f As soon as you get the hang of simple macros like those illustrated above, 
you will probably begin to think, “Boy, wouldn’t it be nice if I could have a 
macro in which some of the text in the expansion is changeable? I’d like to be able 
to stick different things into the middle of that text.” Well, TRX has good news for 
you: Control sequences can be defined in terms of parameters, and you can supply 
arguments that will be substituted for the parameters. 


f For example, let’s consider \xvec again. Suppose that you not only refer 
to ‘(xi, . . . ,x n )\ but you also make frequent use of ‘{yi , . . . , y n )' and other 
similar things. Then you might want to type 


format 

business correspondence 
Thor 

form letters 

Internal Revenue Service 

braces 

grouping 

parameters 

arguments 


\def\row#l{ (#1_1 , \ldots ,#l_n) } 
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after which \row x will produce ‘(xi , . . . , x n )’ and \row y will produce ‘(yi , . . . , (/„)’. 
The symbol #1 stands for the first parameter to the macro, and when you say ‘\row x’ 
the x is a so-called argument that will be inserted in place of the Si’s in the replace- 
ment text. In this case the argument consists of a single letter, x. You can also say 
\row\alpha, in which case the argument will be the control sequence \alpha, and the 
result will be ‘(cvi, ...,«„)’. If you want the argument to contain more than one symbol 
or control sequence, you can simply enclose it in braces; for example, \row{x’} yields 
(x'i, . . . ,x' n ). The argument in this case is x’ (without the braces). Incidentally, if you 
say \row{{x’}}, you get {x'i , . . . , a?' lTS ); the reason is that only one pair of braces is 
stripped off when the argument is collected, and (x'i , . . . , x'„ ) is what you get from 
({x’}_l,\ldots,{x’}_n) in math mode, according to the rules of Chapter 16. 



► EXERCISE 20.3 

Continuing this example, what is the result of $\row{\bf x}$? 



The notation ‘#T suggests that there might be an opportunity to have more 
than one parameter, and indeed there is. You can write, for example, 


\def \row#l#2{ (#1_1 , \ldots , #1_#2) } 


after which ‘\row xn’ would be the proper protocol for ‘(aq, . . . ,x„)’. There can be as 
many as nine parameters, #1 to #9, and when you use them you must number them in 
order. For example, you can’t use #5 in a definition unless the previous parameter 
in that definition was called #4. (This restriction applies only to the initial statement 
of parameters, before the replacement text starts; the stated parameters can be used 
any number of times, in any order, in the replacement text itself.) 

f A control sequence has only one definition at a time, so the second definition 
of \row would supersede the first one if both had appeared in the same doc- 
ument. Whenever TgX encounters a macro that it wants to expand, it uses the most 
recent definition. However, definitions are local to the group that contains them; old 
definitions will be restored in the usual way when a group ends. 

f Caution: When you define a macro with simple parameters, as in these exam- 
ples, you must be careful not to put blank spaces before the that begins the 
replacement text. For example, ‘\def\row #1 #2 {...}’ will not give the same result 
as ‘\def\row#l#2{ . . .}’, because the spaces after #1 and #2 tell TgX to look for argu- 
ments that are followed by spaces. (Arguments can be “delimited” in a fairly general 
way, as explained below.) But the space after \row is optional, as usual, because TpX 
always disregards spaces after control words. After you have said ‘\def \row#l#2{ ...}’, 
you are allowed to put spaces between the arguments (e.g., ‘\row x n’), because TfrX 
doesn’t use single spaces as undelimited arguments. 

f The following exercise is particularly recommended for people who want to 
learn to write l^X macros. Even if you have gotten into the dangerous habit 
of skimming other exercises, you should try your hand at this one. 

►EXERCISE 20.4 

JL Extending exercise 20.1, write a “generalized punishment” macro that has two 
parameters, so that \punishment{run}{the halls} will produce 100 paragraphs that 
say ‘I must not run in the halls.’ 


sharpsign 

alpha 

braces 

apostrophe 

local 

group 
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/gK/gK T|-'X also allows you to define macros whose parameters are delimited in quite 
JL JL a general way; you needn’t always enclose arguments in braces. For example, 

\def\cs #1. #2\par{...} 

defines a control sequence \cs with two parameters, and its two arguments will be 
determined as follows: #1 will consist of all tokens between \cs and the next subsequent 
appearance of ‘. u ’ (period and space); #2 will consist of all tokens between that ‘. u ’ 
and the next \par token. (The \par might be given explicitly, or it might be generated 
by a blank line as explained in Chapter 8.) For example, when TjrX expands 

\cs You owe \$5.00. Pay it.\par 


the first argument is ‘You owe \$5.00’ and the second is ‘Pay it.’. The period in 
‘\$5.00’ doesn’t stop #1, in this example, because TjrjX keeps going until finding a 
period that is followed immediately by a space. 



Furthermore, an argument will not stop when its delimiter is enclosed in 
braces, because that would produce unbalanced braces. For example, in 


\def \cs #1 . #2\par{ . . . } 

the first argument is now delimited by a single period, so #1 would be ‘You owe \$5’ 
and the #2 would be ‘00. Pay it.’ if \cs were invoked as above. But 


\cs You owe {\$5.00}. Pay it.\par 


satisfactorily hides the first period, making it part of argument #1, which becomes 
‘You owe {\$5.00}’. 


/gL/gL If you are designing a format for mathematical papers, you will probably 
JL JL want to include a macro for the statement of theorems, definitions, lemmas, 
corollaries, and such things. For example, you might want to typeset a statement like 


Theorem 1. Tpfi has a powerful macro capability. 


from the input 


\proclaim Theorem 1. \TeX\ has a powerful macro capability . \par 

In fact, plain TjgX includes a \proclaim macro that does just that; its definition is 

\def\proclaim #1. #2\par{\medbreak 

\noindent{\bf #1 . \enspace}{\sl#2}\par\medbreak} 

so the arguments are delimited exactly as in our first \cs example. The replacement 
text here uses \medbreak to separate the proclaimed paragraph from what precedes 
and follows; the title of the proclamation is set in bold face type, while the text itself is 
set slanted. (The actual definition of \proclaim in Appendix B is not quite the same 
as this; the final \medbreak has been modified so that a break between pages will be 
discouraged immediately following the statement of a theorem. Hence a short theorem 
will tend to appear at the top of a page rather than at the bottom. ) 

/gL/gL By making changes to the \proclaim macro, you can change the format of 
JL JL all the proclamations in your paper, without changing the text of the paper 
itself. For example, you could produce something like 

THEOREM 1: TpfX has a powerful macro capability. 


par 

theorems 

proclaim 

enunciations, see proclaim 
enspace 
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by making simple alterations to the replacement text of \proclaim, assuming that you 
have a “caps and small caps” font. TgX is intended to support higher-level languages 
for composition in which all of the control sequences that a user actually types are 
macros rather than T^X primitives. The ideal is to be able to describe important 
classes of documents in terms of their components, without mentioning actual fonts or 
point sizes or details of spacing; a single style-independent document can then be set 
in many different styles. 

/gK/gK Now that we have seen a number of examples, let’s look at the precise rules 
JL JL that govern TfjX macros. Definitions have the general form 

\def (control sequence) (parameter text) {(replacement text)} 


where the (parameter text) contains no braces, and where all occurrences of { and } 
in the (replacement text) are properly nested. Furthermore the # symbol has a special 
significance: In the (parameter text), the first, appearance of # must be followed by 1, 
the next by 2, and so on; up to nine #’s are allowed. In the (replacement text) each # 
must be followed by a digit that appeared after # in the (parameter text), or else the # 
should be followed by another #. The latter case stands for a single # token when the 
macro is expanded; the former case stands for insertion of the corresponding argument. 

/gL/gL For example, let’s consider a “random” definition that doesn’t do anything 
JL JL useful except that it does exhibit TfrX’s rules. The definition 


\def\cs AB#1#2C$#3\$ {#3{ab#l}#l c##\x #2} 


says that the control sequence \cs is to have a parameter text consisting of nine tokens 
An, Bn, #1, #2, Cn, $3, #3, [$], uio 

(assuming the category codes of plain TJpX), and a replacement text of twelve tokens 

#3, -Ci , an, bn, #1, } 2 , #1, uio, cn, #6, [x], #2. 

Henceforth when T)?X reads the control sequence \cs it will expect that the next two 
tokens will be An and Bn (otherwise you will get the error message ‘Use of \cs doesn’t 
match its definition’); then conies argument #1, followed by argument #2, then Cn, 
then $3, then argument #3, then \$, and finally a space token. It is customary to use the 
word “argument” to mean the string of tokens that gets substituted for a parameter; 
parameters appear in a definition, and arguments appear when that definition is used. 
(For the purposes of these rules, we are extending Chapter 7’s definition of token: 
In addition to control sequences and (character code, category code) pairs, T)jX also 
recognizes “parameter tokens,” denoted here by #1 to #9. Parameter tokens can appear 
only in token lists for macros.) 

/gL/gL How does T)gX determine where an argument stops, you ask. Answer: There 
JL JL are two cases. A delimited parameter is followed in the (parameter text) 
by one or more non-parameter tokens, before reaching the end of the parameter text 
or the next parameter token; in this case the corresponding argument is the shortest 
(possibly empty) sequence of tokens with properly nested { . . . } groups that is followed 
in the input by this particular list of non-parameter tokens. (Category codes and 
character codes must both match, and control sequence names must be the same. ) An 
undelimited parameter is followed immediately in the (parameter text) by a parameter 
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token, or it occurs at the very end of the parameter text; in this case the corresponding 
argument is the next nonblank token, unless that token is when the argument will 
be the entire { . . . } group that follows. In both cases, if the argument found in this 
way has the form ‘{{nested tokens}}’, where (nested tokens) stands for any sequence of 
tokens that is properly nested with respect to braces, the outermost braces enclosing 
the argument are removed and the (nested tokens) will remain. For example, let’s 
continue with \cs as defined above and suppose that the subsequent text contains 

\cs AB {\Look}C${And\$ }{look}\$ 5. 

Argument #1 will be the token |Look| , since #1 is an undelimited parameter (it is 
followed immediately by #2 in the definition); in this case T)gX ignores the blank space 
after B, and strips the braces off of {\Look}. Argument #2 will be empty, since C$ 
follows immediately. And argument #3 will be the thirteen tokens corresponding to 
the text {And\$ u Hlook}, because #3 is to be followed by ‘\$ u \ and because the first 
occurrence of ‘\$ u ’ is within braces. Even though argument #3 begins with a left brace 
and ends with a right brace, the braces are not removed, since that would leave the 
unnested tokens ‘And\$ Hlook’. The net effect then, after substituting arguments for 
parameters in the replacement text, will be that Tj?X will next read the token list 

{And\$ }{look}{ab\Look}\Look u c#\x5 . 


The space u here will be part of the resulting token list, even though it follows the 
control word \Look, because spaces are removed after control word tokens only when 
T)gX first, converts input lines to token lists as described in Chapter 8. 


(>)<>)*■ EXERCISE 20.5 

JL JL The example definition of \cs includes a ## in its replacement text, but the 
way ## is actually used in that example is rather pointless. Give an example of a 
definition where ## serves a useful purpose. 

A special extension is allowed to these rules: If the very last character of 
JL JL the (parameter text) is #, so that this # is immediately followed by {, TpX 
will behave as if the { had been inserted at the right end of both the parameter text 
and the replacement text. For example, if you say ‘\def\a#l#{\hbox to #1}’, the 
subsequent text ‘\a3pt{x}’ will expand to ‘\hbox to 3pt{x}’, because the argument 
of \a is delimited by a left brace. 

Tokens that precede the first parameter token in the (parameter text) of a 
JL JL definition are required to follow the control sequence; in effect, they become 
part of the control sequence name. For example, the author might have said 


\def\TeX/{. . .} 


instead of defining \TeX without the slash. Then it would be necessary to type \TeX/ 
each time the T})X logo is desired, but the new definition would have the advantage that 
spaces are not ignored after ‘\TeX/’. You can use this idea to define macros that are 
intended to be used in sentences, so that users don’t have to worry about the possible 
disappearance of spaces. 


(>)<>)*■ EXERCISE 20.6 

JL JL Define a control sequence \a such that \a{ . . . } expands to \b{ . . . }, and such 
that T)r}X gives an error message if \a is not immediately followed by a left brace. 
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Complicated macros have a habit of behaving differently from what you ex- 
JL JL pect, when you first define them, even though TgX’s rules are not especially 
complicated. If you have trouble understanding why some \def doesn’t work the way 
you think it should, help is available: You can set \tracingmacros=l, whereupon TpX 
will write something in your log file whenever it expands a macro, and whenever it 
has read a macro argument. For example, if \tracingmacros is positive when Tf^X 
processes the \cs example above, it will put the following four lines into the log: 

\cs AB#1#2C$#3\$ ->#3{ab#l}#l c##\x #2 
#l<-\Look 


#3<-{And\$ Hlook} 

&)($) In all of the rules stated above, and and “#’ stand for any characters 
JL JL whose category codes are respectively 1, 2, and 6 in the token list when TjgX 
reads the macro definition; there’s nothing sacred about the particular symbols that 
plain TjjX uses to denote grouping and parameters. You can even make use of several 
different characters with these category codes, all at the same time. 

(£)(£)*■ EXERCISE 20.7 

JL JL Suppose that ‘ [’, “]’, and “!’ have the respective catcodes 1, 2, and 6, as do 
and “#’. See if you can guess what the following definition means: 

\def\ ! ! 1#2 ! [{!#]#! ! 2} 

What token list will result when ‘\ ! x{[y]] [z}’ is expanded? 

/>)/>) In practice, we all make mistakes. And one of the most, common typographic 
JL JL errors is to forget a or to insert an extra ‘{’, somewhere in an argument 
to a macro. If TjrjX were to follow the rules blindly in such a case, it would have to 
keep absorbing more and more tokens in hopes of finding the end of the argument. But 
a mistyped argument is unending, like so many arguments in real life (sigh); so TjrX 
would have to go on until the end of the file, or (more likely) until tokens completely fill 
the computer’s memory. In either case, a single typographical error would have ruined 
the run, and the user would be forced to start over. Therefore TgX has another rule, 
intended to confine such errors to the paragraph in which they occur: The token Apar’ 
is not allowed to occur as part of an argument, unless you explicitly tell TpX that \par 
is OK. Whenever Tj?X is about to include \par as part of an argument, it will abort 
the current macro expansion and report that a “runaway argument” has been found. 

/>)/>) If you actually want a control sequence to allow arguments with \par tokens, 
JL JL you can define it to be a “long” macro by saying Along’ just before Adef’. 
For example, the \bold macro defined by 

\long\def \bold#l{{\bf #1}} 

is capable of setting several paragraphs in boldface type. (However, such a macro is 
not an especially good way to typeset bold text. It would be better to say, e.g., 

\def\beginbold{\begingroup\bf } 

\def\endbold{\endgroup} 

because this doesn’t fill TjrjX’s memory with a long argument.) 
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/>>/>) The \par-forbidding mechanism doesn’t catch all conceivable missing-brace 
JL JL errors, however; you might forget the } at the end of a \def, and the same 
problem would arise. In this case it’s harder to confine the error, because \par is a useful 
thing in replacement texts; we wouldn’t want to forbid \par there, so TgX has another 
mechanism: When a macro definition is preceded by ‘\outer’, the corresponding control 
sequence will not be allowed to appear in any place where tokens are being absorbed 
at high speed. An \outer macro cannot appear in an argument (not even when \par is 
allowed), nor can it appear in the parameter text or the replacement text of a definition, 
nor in the preamble to an alignment, nor in conditional text that is being skipped over. 
If an \outer macro does show up in such places, stops what it is doing and reports 
either a “runaway” situation or an “incomplete” conditional. The end of an input file 
is also considered to be \outer in this sense; for example, a file shouldn’t end in the 
middle of a definition. If you are designing a format for others to use, you can help them 
detect errors before too much harm is done, by using \outer with all control sequences 
that should appear only at “quiet times” within a document. For example, Appendix B 
defines \proclaim to be \outer, since a user shouldn’t be stating a theorem as part of 
a definition or argument or preamble. 


/>)/>) We have now seen that \def can be preceded by \long or \outer, and it 
JL JL can also be preceded by \global if the definition is supposed to transcend 
its group. These three prefixes can be applied to \def in any order, and they can 
even appear more than once. 1)?X also has a \gdef primitive that is equivalent to 
\global\def. Thus, for example, 


\long\outer\global\long\def 


means the same thing as ‘\outer\long\gdef ’. 



So far in this manual we have encountered several ways to assign a meaning 
to a control sequence. For example, 


\font\cs={external font name) makes \cs a font identifier; 
\chardef\cs=(number) makes \cs a character code; 

\countdef \cs=(number) makes \cs a \count register; 

\def\cs. makes \cs a macro. 


It’s time now to reveal another important command of this type: 

\let\cs=(token) gives \cs the token’s current meaning. 

If the (token) is another control sequence, \cs will acquire the same significance as 
that control sequence. For example, if you say ‘\let\a=\def ’, you could then say 
‘\a\b. to define a macro \b, because \a would behave like T^X’s primitive 

\def command. If you say 


\let\a=\b \let\b=\c \let\c=\a 


you have interchanged the former meanings of \b and \c. And if you say 

\outer\def \a#l . {#1 : } 

\let\b=\a 
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the effect is exactly the same as ‘\outer\def\b#l.{#l:} \let\a=\b’. 
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If the (token) in a \let is a single character — i.e., if it is a (character code, 
JL JL category code) pair — then the control sequence will behave to a certain extent 
like that character; but there are some differences. For example, after ‘\let\zero=0’ 
you can’t use \zero in a numerical constant, because TgX requires the tokens in a 
numerical constant to be digits, after macro expansion; \zero is not a macro, so it 
doesn’t expand. However, such uses of \let have their value, as we will see later. 

EXERCISE 20.8 

JL JL Is there a significant difference between ‘\let\a=\b’ and ‘\def \a{\b}’ ? 
(£)(£)*■ EXERCISE 20.9 

JL JL Experiment with T^X to discover the answers to the following questions: (a) If 
the control sequence \par has been redefined (e.g., ‘\def\par{\endgroup\par}’ ), is 
\par still forbidden to appear in an argument? (b) If you say \let\xpar=\par, is 
\xpar also forbidden in an argument? 

TeX also allows the construction l \futureletMs(tokeni)(token 2 )’, which has 
JL JL the effect of ‘\let\cs = (token 2 )(tokeni)(token 2 )’. The idea is that you can 
say, for example, ‘\futurelet\a\b’ at the end of the replacement text of a macro; 
T)rX will set \a to the token that follows the macro, after which \b will be expanded. 
The control sequence \b can continue the processing, and it can examine \a to see 
what’s coming up next. 

f Tlre next thing a person wants, after getting used to macros with parameters, 
is the ability to write macros that change their behavior depending on current 
conditions. T]-X provides a variety of primitive commands for this purpose. The general 
form of such “conditional text” is 

\if (condition) (true text)\else(false text)\fi 

where the (true text) is skipped unless the (condition) is true, and the (false text) 
is skipped unless the (condition) is false. If the (false text) is empty, you can omit 
the \else. The ‘\if (condition)’ part of this construction begins with a control sequence 
whose first two letters are ‘if’; for example, 

\if odd\count0 \rightpage \else\leftpage \fi 

specifies a condition that is true when TgX’s integer register \count0 is odd. Since TpX 
generally keeps the current page number in \countO, the macro \rightpage will be 
expanded in this example if the page number is odd, while Meftpage will be expanded 
if the page number is even. Conditional commands always end with a final ‘\fi’. 

f Conditionals are primarily intended for experienced users, who want to 
define high-level macros; therefore the remaining paragraphs in this chapter 
are headed by “double dangerous bends.” Do not feel guilty about skipping right to 
Chapter 21; in other words, imagine that the manual says ‘\if experienced’ right here, 
and that there is a matching ‘\fi’ at the end of the present chapter. 

&)(&) Before we discuss T^jX’s repertoire of \if . . . commands, let’s look at another 
JL JL example, so that the general ideas will be clear. Suppose that the Mount 
register \balance holds an amount that somebody has paid in excess of his or her 
income tax; this amount is given in pennies, and it might be positive, negative, or zero. 
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Our immediate goal will be to write a TjrjX macro that generates a suitable statement 
for the Internal Revenue Service to include as part of a letter to that person, based on 
the amount of the balance. The statement will be quite different for positive balances 
than for negative ones, so we can exploit TjjX’s ability to act conditionally: 

\def\statement{\ifnum\balance=0 \fullypaid 
\else\ifnum\balance>0 \overpaid 
\else\underpaid 
\fi 

\fi} 

Here \ifnum is a conditional command that compares two numbers; the \statement 
macro reduces to \fullypaid if the balance is zero, and so on. 

/gb/gb It is vastly important to notice the spaces after the 0’s in this construction. 
X X If the example had said 

. . . =0\fullypaid. . . 

then IjgX would have begun to expand ‘\fullypaid’ before it knew the value of the 
constant 0, because \fullypaid might start with a 1 or something that would change 
the number. (After all, ‘0T is a perfectly acceptable (number), in TjjX’s eyes.) In this 
particular case the program would still have worked, because we will see in a moment 
that \fullypaid begins with the letter Y; thus, the only problem caused by the missing 
space would be that TjjX would go slower, since it would have to skip over the whole 
expansion of \fullypaid instead of just skipping \f ullypaid as a single, unexpanded 
token. But in other situations a missing space like this might cause TjrX to expand 
macros when you don’t want any expansion, and such anomalies can cause subtle and 
confusing errors. For best results, always put a blank space after a numeric constant; 
this blank space tells T|’X that the constant is complete, and such a space will never “get 
through” to the output. In fact, when you don’t have a blank space after a constant, 
TjgX actually has to do more work, because each constant continues until a non-digit 
has been read; if this non-digit is not a space, IjgX takes the token you did have and 
backs it up, ready to be read again. (On the other hand, the author often omits the 
space when a constant is immediately followed by some other character, because extra 
spaces do look funny in the file; aesthetics are more important than efficiency.) 

EXERCISE 20.10 

JL JL Continuing the IRS example, assume that \fullypaid and \underpaid are 
defined as follows: 

\def\fullypaid{Your taxes are fully paid thank you.} 

\def\underpaid{{\countO=-\balance 

\ifnum\count0<100 

You owe \dollaramount , but you need not pay it, because 
our policy is to disregard amounts less than \$1.00. 

\else Please remit \dollaramount\ within ten days, 
or additional interest charges will be due.\fi}} 

Write a macro \overpaid to go with these, assuming that \dollaramount is a macro 
that generates the contents of \countO in dollars and cents. Your macro should say 
that a check will be mailed under separate cover, unless the amount is less than $1.00, 
in which case the person must specifically request a check. 
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/gK/gK^ EXERCISE 20.11 

JL JL Write a \dollaramount macro, to complete the Internal Revenue \statement. 

/gK/gK Now let’s make a complete survey of TjjX’s conditional commands. Some of 
JL JL them involve features that have not yet been introduced in this manual. 

■ \ifnum{numberi){relation}(number 2 ) (compare two integers) 

The (relation) must be either ‘<i 2 ’ or ‘= 12 ’ or ‘> 12 ’. The two integer numbers are 
compared to each other in the usual way, and the result is true or false accordingly. 

■ \ifdim(dimeni)(relation)(dimen 2 ) (compare two dimensions) 

This is like \ifnum, but it compares two (dimen) values. For example, to test whether 
the value of \hsize exceeds 100 pt, you can say ‘\ifdim\hsize>100pt’. 

■ \ifodd(number) (test for odd integer) 

The condition is true if the (number) is odd, false if it is even. 

■ \ifvmode (test for vertical mode) 

True if TjgX is in vertical mode or internal vertical mode (see Chapter 13). 

■ \ifhmode (test for horizontal mode) 

True if TjrjX is in horizontal mode or restricted horizontal mode (see Chapter 13). 

■ \ifmmode (test for math mode) 

True if T)?X is in math mode or display math mode (see Chapter 13). 

■ \ifinner (test for an internal mode) 

True if TjrjX is in internal vertical mode, or restricted horizontal mode, or (nondisplay) 
math mode (see Chapter 13). 

■ \ if (token i)(token 2 ) (test if character codes agree) 

IjjX will expand macros following \if until two unexpandable tokens are found. If 
either token is a control sequence, TpX considers it to have character code 256 and 
category code 16, unless the current equivalent of that control sequence has been \let 
equal to a non-active character token. In this way, each token specifies a (charac- 
ter code, category code) pair. The condition is true if the character codes are equal, 
independent of the category codes. For example, after \def\a{*} and Met\b=* and 
\def\c{/}, the tests ‘Mf*\a’ and ‘\if\a\b’ will be true, but ‘\if\a\c’ will be false. 
Also ‘\if\a\par’ will be false, but ‘\if\par\let’ will be true. 

■ \if cat(tokeni)(token 2 ) (test if category codes agree) 

This is just like \if, but it tests the category codes, not the character codes. Active 
characters have category 13, but you have to say ‘\noexpand(active character)’ in order 
to suppress expansion when you are looking at such characters with \if or \ifcat. For 
example, after 

\catcode ‘ [=13 \catcode ‘] =13 \def[{*} 

the tests ‘\ifcat\noexpand[\noexpand] ’ and ‘Mfcat[*’ will be true, but the test 
‘\if cat\noexpand [*’ will be false. 
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■ \ifx{tokeni){token 2 ) (test if tokens agree) 

In this case, T|-'X does not expand control sequences when it looks at the two tokens. 
The condition is true if (a) the two tokens are not macros, and they both represent the 
same (character code, category code) pair or the same T|-'X primitive or the same \font 
or \chardef or \countdef, etc..; or if (b) the two tokens are macros, and they both 
have the same status with respect to \long and \outer, and they both have the same 
parameters and “top level” expansion. For example, after ‘\def\a{\c} \def\b{\d} 
\def\c{\e} \def\d{\e} \def\e{A}’, an \ifx test will find \c and \d equal, but not 
\a and \b, nor \d and \e, nor any other combinations of \a, \b, \c, \d, \e. 

■ \ifvoid(number), \ifhbox(number), \ifvbox(number) (test a box register) 
The (number) should be between 0 and 255. The condition is true if that \box is void 
or contains an hbox or a vbox, respectively (see Chapter 15). 

■ \ifeof (number) (test for end of file) 

The (number) should be between 0 and 15. The condition is true unless the corre- 
sponding input stream is open and not fully read. (See the command \openin below.) 

■ \iftrue, \iffalse (always true or always false) 

These conditions have a predetermined outcome. But they turn out to be useful in 
spite of this, as explained below. 

Finally, there’s one more conditional construction, which is somewhat different 
from the rest because it is capable of making a many-way branch: 

■ \ if case (number) (text for case 0)\or(text for case l)\or • • • 

\or(text for case n)\else(text for all other cases) \fi 


Here there are n + 1 cases separated by n \or’s, where n can be any nonnegative 
number. The (number) selects the text that will be used. Once again the \else part is 
optional, if you don’t want to specify any text for cases when the (number) is negative 
or greater than n. 

(£)(£)>■ EXERCISE 20.12 

JL JL Design a \category macro that prints a character’s current category code 
symbolically, given a one-character control sequence for that character. For example, if 
the category codes of plain TgX are in force, ‘\category\V should expand to ‘escape’, 
and ‘\category\a’ should expand to ‘letter’. 

<§)(£)*■ EXERCISE 20.13 

JL JL Test yourself on the following questions to see if you understand certain bor- 
derline situations: After the definitions ‘\def\a{> \def\b{**} \def\c{True}’, which 
of the following are true? (a) ‘\if\a\b’; (b) ‘\if cat\a\b’; (c) ‘\ifx\a\b’; (d) ‘\if\c’; 
(e) ‘\ifcat\c’; (f) ‘\ifx\ifx\ifx’. (g) ‘\if \ifx\a\b\c\else\if \a\b\c\f i\f i’. 

Notice that all of the control sequences for conditionals begin with \if . . ., 
JL JL and they all have a matching \fi. This convention — that \if . . . pairs up 
with \fi — makes it easier to see the nesting of conditionals within your program. The 
nesting of \if . . . \f i is independent of the nesting of { . . . }; thus, you can begin or end 
a group in the middle of a conditional, and you can begin or end a conditional in the 
middle of a group. Extensive experience with macros has shown that such independence 
is important in applications; but it can also lead to confusion if you aren’t careful. 
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/>)/>) sometimes desirable to pass information from one macro to another, and 
JL JL there are several ways to do this: by passing it as an argument, by putting 
it into a register, or by defining a control sequence that contains the information. 
For example, the macros \hphantom, \vphantom, and \phantom in Appendix B are 
quite similar, so the author wanted to do most of the work in another macro \phant 
that would be common to all three. Somehow \phant was to be told what kind of 
phantom was desired. The first approach was to define control sequences \hph and 
\vph something like this: 

\def\hphantom{\ph YN} \def \vphantom{\ph NY} \def\phantom{\ph YY} 
\def\ph#l#2{\def\hph{#l}\def\vph{#2}\phant} 

after which \phant could test ‘\if Y\hph ! and ‘\if Y\vph’. This worked, but there were 
various ways to make it more efficient; for example, l \def\hph{#l}’ could be replaced 
by ‘\let\hph=#T, avoiding macro expansion. An even better idea then suggested itself: 

\def\yes{\ifOO} \def\no{\if 01} 

\def \hphantom{\ph\yes\no} . . . \def \phantom{\ph\yes\yes} 
\def\ph#l#2{\let\ifhph=#l\let\ifvph=#2\phant} 

after which \phant could test Aifhph’ and Aifvph’. (This construction was tried 
before \iftrue and \iffalse were part of the l^X language.) The idea worked fine, 
so the author started to use \yes and \no in a variety of other situations. But then 
one day a complex conditional failed, because it contained an \ifhph-like test inside 
another conditional: 

\ifhph...\fi ... \else ... \fi 

Do you see the problem that developed? When the (true text) of the outermost con- 
ditional was executed, everything worked fine, because \ifhph was either \yes or \no 
and it expanded into either \if00 or \if01. But when the (true text) was skipped, the 
\ifhph was not expanded, so the first \fi was mistakenly paired with the first \if; 
everything soon went haywire. That’s when \iftrue and \iffalse were put into the 
language, in place of \yes and \no; now \ifhph is either \iftrue or \iffalse, so T^X 
will match it properly with a closing \fi, whether or not it is being skipped over. 

/gb/gb To facilitate \if . . . constructions, plain Tp,X lias a \newif macro, such that 
JL JL after you say ‘\newif \ifabc’ three control sequences will be defined: \ifabc 
(for testing the switch), \abctrue (for making the switch true), and \abcfalse (for 
making it false). The \phantom problem is now solved in Appendix B by writing 

\newif\ifhph \newif\ifvph 
\def\hphantom{\hphtrue\vphf alse\phant} 

and with similar definitions of \vphantom and \phantom. There is no longer any need 
for a \ph macro; again \phant tests \ifhph and \ifvph. Appendix E contains other 
examples of conditionals created by \newif. New conditionals are initially false. 

/>)/>) Caution: Don’t say anything like ‘\let\if abc=\iftrue’ in conditional text. 
JL JL If TeX skips over this command, it will think that both \ifabc and \iftrue 
require a matching \fi, since the \let is not being executed! Keep such commands 
buried inside macros, so that T^X will see the ‘\if . . .’ only when it is not skipping 
over the text that it is reading. 
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/gK/gK TJlX has 256 “token list registers” called \toksO through \toks255, so that 
JL JL token lists can easily be shuffled around without passing them through Tj?X’s 
reading apparatus. There’s also a \toksdef instruction so that, e.g., 

\toksdef\catch=22 


makes \catch equivalent to \toks22. Plain T)?X provides a \newtoks macro that 
allocates a new token list register; it is analogous to \newcount. Token list registers 
behave like the token list parameters \everypar, \everyhbox, \output, \errhelp, etc. 
To assign a new value to a token list parameter or register, you say either 

(token variable) ={ (replacement text)} 
or (token variable)=(token variable) 


where (token variable) means either a token list parameter or a control sequence defined 
by \toksdef or \newtoks, or an explicit register designation ‘\toks(number)\ 

/gL/gL Everyone who makes extensive use of a powerful macro facility encounters 
JL JL situations when the macros do surprising things. We have already mentioned 
the possibility of setting \tracingmacros=l, in order to see when TjgX expands macros 
and what arguments it finds. There’s also another helpful way to watch what Tp:X is 
doing: If you set \tracingcoiranands=l, IjvX will show every command that it executes, 
as we saw in Chapter 13. Furthermore, if you set \tracingcommands=2, T} 5 X will show 
all conditional commands and their outcomes, as well as the unconditional commands 
that are actually performed or expanded. This diagnostic information goes into your log 
file. You can also see it on your terminal, if you say \tracingonline=l. (Incidentally, 
if you make \tracingcoiranands greater than 2, you get the same information as when 
it equals 2.) Similarly, \tracingmacros=2 will trace \output, \everypar, etc. 


/gL/gL One way to understand the occasional strangeness of macro operation is to 
JL JL use the tracing features just described, so that you can watch what TjgX does 
in slow motion. Another way is to learn the rules for how macros are expanded; we 
shall now discuss those rules. 


/gL/gL TjjX’s mastication process converts your input to a long token list, as explained 
JL JL in Chapter 8; and its digestive processes work strictly on this token list. When 
TjgX encounters a control sequence in the token list, it looks up the current meaning, 
and in certain cases it will expand that token into a sequence of other tokens before 
continuing to read. The expansion process applies to macros and to certain other 
special primitives like \number and \if that we shall consider momentarily. Sometimes, 
however, the expansion is not carried out; for example, when TjgX is taking care of a 
\def, the (control sequence), the (parameter text), and the (replacement text) of that 
\def are not subject to expansion. Similarly, the two tokens after \ifx are never 
expanded. A complete list of occasions when tokens are not expanded appears later in 
this chapter; you can use it for reference in an emergency. 



Now let’s consider the control sequences that are expanded whenever expan- 
sion lias not been inhibited. Such control sequences fall into several classes: 


■ Macros. When a macro is expanded, TjjX first determines its arguments (if 
any), as explained earlier in this chapter. Each argument is a token list; the tokens 
are not expanded when they are being accepted as arguments. Then IjjX replaces the 
macro and its arguments by the replacement text. 
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■ Conditionals. When an \if . . . is expanded, TjrX reads ahead as far as nec- 
essary to determine whether the condition is true or false; and if false, it skips ahead 
(keeping track of \if . . .\fi nesting) until finding the \else, \or, or \fi that ends 
the skipped text. Similarly, when \else, \or, or \fi is expanded, IjrX reads to the 
end of any text that ought to be skipped. The “expansion” of a conditional is empty. 
(Conditionals always reduce the number of tokens that are seen by later stages of the 
digestive process, while macros usually increase the number of tokens.) 

■ \number (number). When TjrjX expands Xnumber, it reads the (number) that 
follows (expanding tokens as it goes); the final expansion consists of the decimal rep- 
resentation of that number, preceded by if negative. 

■ \romannumeral (number). This is like \number, but the expansion consists of 
lowercase roman numerals. For example, ‘Xromannumeral 1984’ produces ‘mcmlxxxiv’. 
The expansion is empty if the number is zero or negative. 

■ Xstring(token). TgX first reads the (token) without expansion. If a control 
sequence token appears, its \ string expansion consists of the control sequence name 
(including \escapechar as an escape character, if the control sequence isn’t simply an 
active character). Otherwise the (token) is a character token, and its character code is 
retained as the expanded result. 

■ \jobname. The expansion is the name that l^jX has chosen for this job. For 
example, if TgX is putting its output on files paper. dvi and paper.log, then \jobname 
expands to ‘paper’. 

■ \fontname(font). The expansion is the external file name corresponding to the 
given font; e.g., ‘\f ontnameXtenrm’ might expand to ‘cmrlO’ (five tokens). If the font is 
not being used at its design size, the “at size” also appears in the expansion. A (font) 
is either an identifier defined by Xfont; or Xtextfont (number), Xscriptfont(number), 
or Xscriptscriptfont(number); or \font, which denotes the current font. 

■ Xmeaning(token). T^X expands this to the sequence of characters that would 
be displayed on your terminal by the commands ‘\let\test=(token) \show\test’. 
For example, ‘Xmeaning A’ usually expands to ‘the letter A’; ‘\meaning\A’ after 
‘\def\A#lB{\C}’ expands to ‘macro : #1B->\C ’. 

■ \csname . . . Xendcsname. When Tf^X expands \csname it reads to the matching 
Xendcsname, expanding tokens as it goes; only character tokens should remain after this 
expansion has taken place. Then the “expansion” of the entire Xcsname. . .Xendcsname 
text will be a single control sequence token, defined to be like Xrelax if its meaning is 
currently undefined. 

■ Xexpandaf ter (token). TjtX first reads the token that conies immediately after 
Xexpandafter, without expanding it; let’s call this token t. Then T^X reads the token 
that comes after t (and possibly more tokens, if that token has an argument), replacing 
it by its expansion. Finally TfrX puts t back in front of that expansion. 

■ Xnoexpand(token). The expansion is the token itself; but that token is inter- 
preted as if its meaning were ‘Xrelax’ if it is a control sequence that would ordinarily 
be expanded by TjjX’s expansion rules. 

■ Xtopmark, Xfirstmark, Xbotmark, \splitf irstmark, and Xsplitbotmark. The 
expansion is the token list in the corresponding “mark” register (see Chapter 23). 
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m \ input {file name). The expansion is null; but T)?X prepares to read from the 
specified file before looking at any more tokens from its current source. 

■ \endinput. The expansion is null. The next time TfrX gets to the end of an 
\input line, it will stop reading from the file containing that line. 

■ \the (internal quantity). The expansion is a list of tokens representing the 
current value of one of T)?X’s variables, as explained below. For example, ‘\the\skip5’ 
might expand into ‘5. Opt plus 2.0fil’ (17 tokens). 

/gb/gb The powerful \the operation has many subcases, so we shall discuss them one 
JL JL at a time. A variety of internal numeric quantities can be brought up front: 

■ \the (parameter), where (parameter) is the name of one of TgX’s integer pa- 
rameters (e.g., \the\widowpenalty), dimension parameters (e.g., \the\parindent), 
glue parameters (e.g., \the\leftskip), or muglue parameters (e.g., \the\thinmuskip). 

■ \the (register), where (register) is the name of one of TJrX’s integer registers 
(e.g., \the\count 0), dimension registers (e.g., \the\dimenl69), glue registers (e.g., 
\the\skip255), or muglue registers (e.g., \the\muskip\count2). 

■ \the (codename) (8-bit number), where (codename) stands for either \catcode, 
\mathcode, \lccode, \uccode, \sfcode, or \delcode. For example, \the\mathcode ‘/ 
produces the current (integer) math code value for a slash. 

■ \the (special register), where (special register) is one of the integer quantities 
\prevgraf, \deadcycles, \insertpenalties, \inputlineno, \badness, or \parshape 
(denoting only the number of lines of \parshape); or one of the dimensions \pagetotal, 
\pagegoal, \pagestretch, \pagef ilstretch, \pagef illstretch, \pagef illlstretch, 
\pageshrink, \pagedepth. In horizontal modes you can also refer to a special integer, 
\the\spacefactor; in vertical modes there’s a special dimension, \the\prevdepth. 

■ \the\fontdimen(parameter number) (font). This produces a dimension; for 
example, parameter 6 of a font is its “em” value, so ‘\the\f ontdimen6\tenrm’ yields 
TO. Opt’ (six tokens). 

■ \the\hyphenchar(font), \the\skewchar(font) . These produce the correspond- 
ing integer values defined for the specified font. 

■ \the\lastpenalty, \the\lastkern, \the\lastskip. These yield the amount 
of penalty, kerning, glue, or muglue in the final item on the current list, provided that 
the item is a penalty, kern, or glue, respectively; otherwise they yield ‘0’ or ‘O.Opt’. 

■ \the (defined character), where (defined character) is a control sequence that 
has been given an integer value with \chardef or \mathchardef ; the result is that 
integer value, in decimal notation. 

/gb/gb °f th® cases listed so far, \the produces a result that is a sequence of 

JL JL ASCII character tokens. Category code 12 (“other”) is assigned to each token, 
except that character code 32 gets category 10 (“space”). The same rule is used to 
assign category codes to the tokens produced by \number, \romannumeral, \ string, 
\meaning, \jobname, and \fontname. 

/gb/gb There also are cases in which \the produces non-character tokens, either a 
JL JL font identifier like \tenrm, or an arbitrary token list: 

■ \the(font) produces a font identifier that selects the specified font. For exam- 
ple, ‘\the\font’ is a control sequence corresponding to the current font. 
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■ \the (token variable) produces a copy of the token list that is the current value 
of the variable. For example, you can expand ‘\the\everypar’ and ‘\the\toks5’. 

TjgX’s primitive command ‘\showthe’ will display on your terminal exactly 
what ‘\the’ would produce in an expanded definition; the expansion is pre- 
ceded by ‘> ’ and followed by a period. For example, ‘\showthe\parindent’ will display 

> 20. Opt. 

if the plain TjrX paragraph indentation is being used. 

Here now is the promised list of all cases when expandable tokens are not ex- 
JL JL panded. Some of the situations involve primitives that haven’t been discussed 
yet, but we’ll get to them eventually. Expansion is suppressed at the following times: 

■ When tokens are being deleted during error recovery (see Chapter 6). 

■ When tokens are being skipped because conditional text is being ignored. 

■ When TfrX is reading the arguments of a macro. 

■ When J^X is reading a control sequence to be defined by Met, \futurelet, 
\def, \gdef, \edef, \xdef, \chardef, \mathchardef , \countdef, \dimendef, 
\skipdef, \muskipdef, \toksdef, \read, and \font. 

■ When TjrX is reading argument tokens for \expandafter, \noexpand, \string, 
\meaning, \let, \futurelet, \ifx, \show, \afterassignment, \aftergroup. 

■ When T^X is absorbing the parameter text of a \def, \gdef, \edef , or \xdef. 

■ When TgX is absorbing the replacement text of a \def or \gdef or \read; 
or the text of a token variable like \everypar or \toksO; or the token list 
for \uppercase or Mowercase or \write. (The token list for \write will be 
expanded later, when it is actually output to a file.) 

■ When J)?X is reading the preamble of an alignment, except after a token for 
the primitive command \span or when reading the (glue) after \tabskip. 

■ Just after a $3 token that begins math mode, to see if another $3 follows. 

■ Just after a ‘12 token that begins an alphabetic constant. 

Sometimes you will find yourself wanting to define new macros whose replace- 
ment text has been expanded, based on current conditions, instead of simply 
copying the replacement text verbatim. TjrX provides the \edef (expanded definition) 
command for this purpose, and also \xdef (which is equivalent to \global\edef ). The 
general format is the same as for \def and \gdef , but TgX blindly expands the tokens 
of the replacement text according to the expansion rules above. For example, consider 

\def\double#l{#l#l} 

\edef \a{\double{xy}} \edef \a{\double\a} 

Here the first \edef is equivalent to ‘\def \a{xyxy}’ and the second is equivalent to 
l \def\a{xyxyxyxy}’. All of the other kinds of expansion will take place too, including 
conditionals; for example, 

\edef \b#l#2{\ifmmode#l\else#2\f i} 


215 


showthe 
error recovery 
let 

futurelet 

def 

gdef 

edef 

xdef 

chardef 

mathchardef 

countdef 

dimendef 

skipdef 

muskipdef 

toksdef 

read 

font 

expandafter 

noexpand 

string 

meaning 

let 

futurelet 

ifx 

show 

afterassignment 

aftergroup 

read 

token variable 

everypar 

toks 

uppercase 

lowercase 

write 

tabskip 

alphabetic constant 

edef 

xdef 


gives a result equivalent to l \def\b#l#2{#l}’ if T^X is in math mode at the time of 
the \edef, otherwise the result is equivalent to ‘\def \b#l#2{#2}’. 
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/>)/>) Expanded definitions that are made with \edef or \xdef continue to expand 
JL JL tokens until only unexpandable tokens remain, except that token lists pro- 
duced by ‘\the’ are not expanded further. Furthermore a token following ‘\noexpand’ 
will not be expanded, since its ability to expand has been nullified. These two opera- 
tions can be used to control what gets expanded and what doesn’t. 

/>)/>) Suppose, for example, that you want to define \a to be equal to \b (expanded) 
JL JL followed by \c (not expanded) followed by \d (expanded), assuming that \b 
and \d are simple macros without parameters. There are two easy ways to do it: 


\edef \a{\b\noexpand\c\d} 

\toksO={\c} \edef\a{\b\the\toksO \d} 


And it’s even possible to achieve the same effect without using either \noexpand or \the; 
a reader who wants to learn more about TJrX’s expansion mechanism is encouraged to 
try the next three exercises. 

&)&)*■ EXERCISE 20.14 

JL JL Figure out a way to define \a as in the previous paragraph, without using 
l^X’s primitives ‘\noexpand’ and ‘\the’. 

^►EXERCISE 20.15 

Continuing the example of expansion avoidance, suppose that you want to 
expand \b completely until only unexpandable tokens are left, but you don’t want 
to expand \c at all, and you want to expand \d only one level. For example, after 
\def\b{\c\c} and \def\c{*} and \def\d{\b\c} the goal would be to get the effect of 
\def\a{**\c\b\c}. How can such a partial expansion be achieved, using \the? 

&)&)*■ EXERCISE 20.16 

JL JL Solve the previous exercise without \the or \noexpand. (This is difficult.) 

/>)/>) TfrX’s primitive commands \mark{. . .}, \message{. . .}, \errmessage{ . . .}, 
JL JL \special{ . . . }, and \write(number){ . . . } all expand the token lists in braces 
almost exactly as \edef and \xdef do. However, a macro parameter character like # 
should not be duplicated in such commands; you need to say ## within an \edef, but 
only # within a \mark. The \write command is somewhat special, because its token 
list is first read without expansion; expansion occurs later, when the tokens are actually 
being written to a file. 

EXERCISE 20.17 

Compare the following two definitions: 

\def\a{\iftrue{\else}\f i} 

\edef \b{\iftrue{\else}\f i} 


Which of them yields an unmatched left brace? (This is tricky.) 

(&)&) TpX has the ability to read individual lines of text from up to 16 files at once, 
JL JL in addition to the files that are being \ input. To initiate reading such an 
auxiliary file, you should say 


\openin(number)=(file name) 


where the (number) is between 0 and 15. (Plain TgX allocates input stream numbers 
0 through 15 with the \newread command, which is analogous to \newbox.) In most 
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installations of TjjX; the extension 1 . tex’ will be appended to the file name, as with 
\input, if no extension is given explicitly. If the file cannot be found, TpX will give no 
error message; it will simply consider that the input, stream is not open, and you can 
test this condition with \ifeof. When you’re done with a file, you can say 

\closein(number) 

and the file associated with that input stream number will be closed, i.e., returned to 
its initial condition, if such a file was open. To get input from an open file, you say 

\read{number)to (control sequence) 

and the control sequence is defined to be a parameterless macro whose replacement 
text is the contents of the next line read from the designated file. This line is converted 
to a token list, using the procedure of Chapter 8, based on the current category codes. 
Additional lines are read, if necessary, until an equal number of left and right braces 
has been found. An empty line is implicitly appended to the end of a file that is being 
\read. If the (number) is not between 0 and 15, or if no such file is open, or if the 
file has ended, input will be from the terminal; T|gX will prompt the user unless the 
(number) is negative. The macro definition will be local unless you say \global\read. 

/gb/gb For example, it’s easy to have dialogs with the user, by using \read together 
JL JL with the \message command (which writes an expanded token list on the 
terminal and in the log file): 

\message{Please type your name:} 

\readl6 to\myname 
\message{Hello , \myname ! } 

The \read command in this case will print ‘\myname=’ and it will wait for a response; 
the response will be echoed on the log file. The ‘\myname=’ would have been omitted if 
‘\readl6’ had been ‘\read-l’. 

(£)(£)>■ EXERCISE 20.18 

JL JL The \myname example just given doesn’t work quite right, because the (return) 
at the end of the line gets translated into a space. Figure out how to fix that glitch. 

(£)(£)*■ EXERCISE 20.19 

JL JL Continuing the previous example, define a macro \MYNAME that contains the 
letters of \myname all in uppercase letters. For example, if \myname expands to Arthur, 
\MYNAME should expand to ARTHUR. Assume that \myname contains only letters and 
spaces in its expansion. 

/>)/>) Appendices B, D, and E contain numerous examples of how to make macros 
JL JL do useful things. Let’s close this chapter by presenting a few examples that 
show how TgX can actually be used as a primitive programming language, if you want 
to achieve special effects, and if you don’t care very much about computer costs. 

/gb/gb Plain T|.;X contains a \loop. . Arepeat construction, which works like this: 
JL JL You say ‘\loop a \if . . . j3 \repeat’, where a and p are any sequences of 
commands, and where \if . . . is any conditional test (without a matching \fi). TpX 
will first do a; then if the condition is true, T}rX will do p and repeat the whole process 
again starting with a. If the condition ever turns out to be false, the loop will stop. 
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For example, here is a program that carries out a little dialog in which TjXX waits for 
the user to type ‘Yes’ or ‘No’: 

\def\yes{Yes } \def\no{No } \newif\ifgarbage 
\loop\message{Are you happy? } 

\read-l to\answer 

\ifx\answer\yes\garbagefalse '/, the answer is Yes 

\else\ifx\answer\no\garbagef alse '/, the answer is No 
\else\garbagetrue\f i\f i '/, the answer is garbage 
\ifgarbage\message{ (Please type Yes or No.)} 

\repeat 

EXERCISE 20.20 

JL JL Use the \loop. . . \repeat mechanism to construct a general \punishment 
macro that repeats any given paragraph any given number of times. For example, 

\punishment{I must not talk in class. }{100} 

should produce the results desired in exercise 20.1. 

The first thirty prime numbers are 2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 
XX 41, 43, 47, 53, 59, 61, 67, 71, 73, 79, 83, 89, 97, 101, 103, 107, 109, and 113. 
You may not find this fact very startling; but you may be surprised to learn that the 
previous sentence was typeset by saying 

The first thirty prime numbers are \primes{30}. 

TjgX did all of the calculation by expanding the \primes macro, so the author is pretty 
sure that the list of prime numbers given above is quite free of typographic errors. Here 
is the set of macros that did it: 

\newif\ifprime \newif\ifunknown '/, boolean variables 
\newcount\n \newcount\p \newcount\d \newcount\a '/, integer variables 
\def\primes#l{2 , ~3*/, assume that #1 is at least 3 
\n=#l \advance\n by-2 '/, n more to go 
\p=5 */, odd primes starting with p 

\loop\ifnum\n>0 \printifprime\advance\p by2 \repeat} 

\def\printp{ , '/, we will invoke \printp if p is prime 
\ifnum\n=l and~\fi / ‘and’ precedes the last value 
\number\p \advance\n by -1 } 

\def\printifprime{\testprimality \ifprime\printp\f i} 
\def\testprimality{{\d=3 \global\primetrue 

\loop\trialdivision \ifunknown\advance\d by2 \repeat}} 
\def\trialdivision{\a=\p \divide\a by\d 

\ifnum\a>\d \unknowntrue\else\unknownf alse\f i 
\multiply\a by\d 

\ifnum\a=\p \global\primef alse\unknownfalse\f i} 

The computation is fairly straightforward, except that it involves a loop inside a loop; 
therefore \testprimality introduces an extra set of braces, to keep the inner loop 
control from interfering with the outer loop. The braces make it necessary to say 
‘\global’ when \ifprime is being set true or false. TpX spent more time constructing 
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that sentence than it usually spends on an entire page; the \trialdivision macro was 
expanded 132 times. 


The \loop macro that does all these wonderful things is actually quite simple. 
JL JL It puts the code that’s supposed to be repeated into a control sequence called 
\body, and then another control sequence iterates until the condition is false: 


\def\loop#l\repeat{\def \body{#l}\iterate} 

\def\iterate{\body\let\next=\iterate\else\let\next=\relax\f i\next} 


The expansion of \iterate ends with the expansion of \next; therefore l^jX is able 
to remove \iterate from its memory before invoking \next, and the memory does not 
fill up during a long loop. Computer scientists call this “tail recursion.” 


/>>/>) The \hex macro below, which converts count register \n to hexadecimal no- 
JL JL tation, illustrates a recursive control structure in which many copies of \hex 
can be active simultaneously. Recursion works better than simple \loop iteration in 
this application because the hexadecimal digits are discovered from right to left, while 
they must be output from left to right. (The number in \n should be >0.) 


\def\hex{{\countO=\n \divide\n byl6 

\ifmim\n>0 \hex\fi \count2=\n \multiply\count2 by-16 
\advance\countO by\count2 \hexdigit}} 
\def\hexdigit{\ifnum\count0<10 \number\countO 

\else\advance\countO by-10 \advance\countO by ‘A \char\countO \fi} 

/>^/>) Our final example is a macro that computes the number of nonblank tokens 
JL JL in its argument; for example, ‘\length{argument}’ expands to ‘8’. This illus- 
trates yet another aspect of macro technique. 

\def\length#l{{\count0=0 \getlength#l\end \number\countO}} 
\def\getlength#l{\ifx#l\end \let\next=\relax 

\else\advance\countO byl \let\next=\getlength\f i \next} 
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By this time [37 A.D.] the influence of Macro had become supreme. 

— TACITUS, Annals (c. 120 A.D.) 


I hate definitions. 
— BENJAMIN DISRAELI, Vivian Grey (1826) 
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In Chapters 11 and 12 we discussed the principles of boxes and glue, and by 
now we have seen many applications of those concepts. You can get by in most 
cases with the boxes that T^X manufactures automatically with its paragraph 
builder, its page builder, and its math formula processor; but if you want to do 
nonstandard things, you have the option of making boxes by yourself. For exam- 
ple, Chapter 14 points out that you can keep something from being hyphenated 
or split between lines if you enclose it in an \hbox; Chapter 19 points out that 
\hbox allows you to get ordinary text into a displayed equation. 

f Tlie purpose of the present chapter is to nail down whatever details about 
boxes haven’t been covered yet. Fortunately, there isn’t much more to discuss; 
we have already mentioned most of the rules, so this chapter is fairly short. In fact, 
the previous chapters have dealt with almost everything except the rules about rules. 


f To make a rule box (i.e., a solid black rectangle), you type ‘\hrule’ in vertical 
mode or ‘\vrule’ in horizontal mode, followed by any or all of the specifications 
‘width(dimen)’, ‘height(dimen)’, ‘depth(dimen)’, in any order. For example, if 


\vrule height4pt width3pt depth2pt 

appears in the middle of a paragraph, TyX will typeset the black box 1 1 ’. If you specify 
a dimension twice, the second specification overrules the first. If you leave a dimension 
unspecified, you get the following by default: 



\hrule 

\vrule 

width 

* 

0.4 pt 

height 

0.4 pt 

* 

depth 

0.0 pt 

* 


Here means that the actual dimension depends on the context; the rule will extend 
to the boundary of the smallest box or alignment that encloses it. 


hbox 

vbox 

rule box 

black rectangle 

hrule 

vrule 

width 

height 

depth 

interline glue 
User 

moveright 


f For example, the author typed ‘\hrule’ just before typing this paragraph, 
and you can see what happened: A horizontal rule, 0.4 pt thick, was extended 
across the page, because the vertical box that encloses it turned out to be just that 
wide. (In fact, the vertical box that encloses it is the page itself.) Another example 
appears immediately after this paragraph, where you can see the result of 

\hrule width5cm heightlpt \vskiplpt \hrule width6cm 

IjrX does not put interline glue between rule boxes and their neighbors in a vertical 
list, so these two rules are exactly 1 pt apart. 


► EXERCISE 21.1 

B. L. User didn’t want one of his horizontal rules to touch the left margin, so 
he put it in a box and moved it right, like this: 

\moveright lin \vbox{\hrule width3in} 

But he found that this produced more space above and below the rule than when he 
had simply said ‘\hrule width 4in’ with no \vbox. Why did l^rX insert more space, 
and what should he have done to avoid it? 
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f lf you specify all three dimensions of a rule, there’s no essential difference 
between \hrule and \vrule, since both will produce exactly the same black 
box. But you must call it an \hrule if you want to put it in a vertical list, and you 
must call it a \vrule if you want to put it in a horizontal list, regardless of whether it 
actually looks like a horizontal rule or a vertical rule or neither. If you say \vrule in 
vertical mode, T{rX starts a new paragraph; if you say \hrule in horizontal mode, TgX 
stops the current paragraph and returns to vertical mode. 

f The dimensions of a rule can be negative; for example, here’s a rule whose 

height is 3pt and whose depth is — 2pt: 1 ’. However, a rule 

is invisible unless its height plus depth is positive and its width is positive. A rule 
whose width is negative cannot be seen, but it acts like a backspace when it appears 
in a horizontal list. 


/■£> ►EXERCISE 21.2 

JL Explain how the author probably obtained the rule 1 
previous paragraph. [Hint: It’s one inch long.] 


in the 


/gK/gK Now let’s summarize all of the ways there are to specify boxes explicitly to 
JL JL TfrX. (1) A character by itself makes a character box, in horizontal mode; 
this character is taken from the current font. (2) The commands \hrule and \vrule 
make rule boxes, as just explained. (3) Otherwise you can make liboxes and vboxes, 
which fall under the generic term (box). A (box) has one of the following seven forms: 


\hbox{box specification) {{horizontal material)} 
\vbox(box specification) {{vertical material)} 
\vtop{box specification) {{vertical material)} 
\box{register number) 

\ copy {register number) 

\vspl it {register number)to{dimen) 

\lastbox 


(see Chapter 12) 
(see Chapter 12) 
(see Chapter 12) 
(see Chapter 15) 
(see Chapter 15) 
(see Chapter 15) 
(see Chapter 21) 


Here a {box specification) is either ‘to{dimen)’ or ‘spread{dimen)’ or empty; this gov- 
erns the setting of glue in the horizontal or vertical lists inside the box, as explained 
in Chapter 12. A {register number) is between 0 and 255; after you say \box, that 
register becomes void, but after \copy the register is unchanged, as explained in Chap- 
ter 15. The \vsplit operation is also explained in Chapter 15. In math modes an 
additional type of box is available: \vcenter{box specification) {{vertical material)} 
(see Chapter 17). 


/gL/gL The bottom line of the table above refers to \lastbox, a primitive operation 
JL JL that hasn’t been mentioned before. If the last item on the current horizontal 
list or vertical list is an libox or vbox, it is removed from the list and it becomes the 
\lastbox; otherwise \lastbox is void. This operation is allowed in internal vertical 
mode, horizontal mode, and restricted horizontal mode, but you cannot use it to take a 
box from the current page in vertical mode. In math modes, \lastbox is always void. 
At the beginning of a paragraph, ‘{\setboxO=\lastbox}’ removes the indentation box. 

/gL/gL The operation \unskip is something like \lastbox, except that it applies to 
JL JL glue instead of to boxes. If the last thing on the current list is a glue item (or 
leaders, as explained below), it is removed. You can’t remove glue from the current 
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page by using \unskip in vertical mode, but. you can say ‘\vskip-\lastskip’, which 
has almost the same effect. 


Chapters 24 to 26 present summaries of all TjrX’s operations in all modes, 
and when those summaries mention a ‘{box}’ they mean one of the seven 
possibilities just listed. For example, you can say ‘\setbox{register number)=(box)’ in 
any mode, and you can say l \moveright(dimen) (box)’ in vertical modes. But you can’t 
say ‘\setbox(register number)=C’ or ‘\moveright(dimen)\hrule’; if you try either of 
these, TfrX will complain that a (box) was supposed to be present. Characters and 
rules are so special, they aren’t regarded as (box)es. 

^►EXERCISE 21.3 

Define a control sequence \boxit so that ‘\boxit{{box}}’ yields the given box 
surrounded by 3 points of space and by ruled lines on all four sides. 


For example, the sentence you are now reading was typeset as part of 
the displayed formula $$\boxit{\boxit{\box4}}$$, where box 4 was 
created by typing l \setbox4=\vbox{\hsize 23pc \noindent \strut 
For example, the sentence you are now reading ... \strut} 1 . 


f Let’s look also at what can go inside a box. An libox contains a horizontal 
list; a vbox contains a vertical list. Both kinds of lists are made up primarily 
of things like boxes, glue, kerns, and penalties, as we have seen in Chapters 14 and 15. 
But you can also include some special things that we haven’t discussed yet, namely 
“leaders” and “whatsits.” Our goal in the rest of this chapter will be to study how to 
make use of such exotic items. 


f The dots you see before your eyes here are called leaders 

because they lead your eyes across the page; such things are often used in 
indexes or tables of contents. The general idea is to repeat a box as many times as 
necessary to fill up some given space. TjrX treats leaders as a special case of glue; 
no, wait, it’s the other way around: TjrX treats glue as a special case of leaders. 
Ordinary glue fills space with nothing, while leaders fill space with any desired thing. 
In horizontal mode you can say 

\leaders(box or rule) \hskip (glue) 

and the effect will be the same as if you had said just l \hskip(glue)’, except that the 
space will be occupied by copies of the specified (box or rule). The glue stretches or 
shrinks in the usual way. For example, 

\def\leaderf ill{\leaders\hbox to lem{\hss . \hss}\hf ill} 
\line{Alpha\leaderf ill Omega} 

\line{The Beginning\leaderf ill The Ending} 


will produce the following two lines: 

Alpha 

The Beginning 


. . Omega 
The Ending 


lastskip 

leaders 

leaders 


Here ‘\hbox to lem{\hss . \hss}’ specifies a box one em wide, with a period in its 
center; the control sequence \leaderfill then causes this box to be replicated when 
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filling space in the Mine box. (Plain TjjX’s Mine macro makes an hbox whose width 
is the \hsize.) 

f Notice that the dots in the two example lines appear exactly above each other. 

This is not a coincidence; it’s a consequence of the fact that the Meaders 
operation acts something like a window that lets you see part of an infinite row of 
boxes. If the words ‘Alpha’ and ‘Omega’ are replaced by longer words, the number of 
dots might be different but the ones that you see will be in the same places as before. 
The infinitely replicated boxes are lined up so that they touch each other, and so that, 
if you could see them all, one of them would have the same reference point as the 
smallest enclosing box. Thus, Meaders will put a box flush with the left edge of an 
enclosing box, if you start the leaders there; but you won’t get a box flush right unless 
the width of the enclosing box is exactly divisible by the width of the repeated box. 
If the repeated box has width w, and if the space to be filled is at least 2w, then you 
will always see at least one copy of the box; but if the space is less than 2 w the box 
may not appear, because boxes in the infinite row are typeset only when their entire 
width falls into the available space. 

(&)&) When leaders are isolated from each other, you might not want them to be 
JL JL aligned as just described, so TgX also provides for nonaligned leaders. In this 
case a box of width w will be copied q times when the space to be filled is at least qw 
and less than ( q + 1 )w; furthermore, the results will be centered in the available space. 
There are two kinds of nonaligned leaders in Tj?X, namely \cleaders (centered leaders) 
and \xleaders (expanded leaders). Centered leaders pack the boxes tightly next to 
each other leaving equal amounts of blank space at the left and right; expanded leaders 
distribute the extra space equally between the q + 1 positions adjacent to the q boxes. 
For example, let’s suppose that a 10 pt-wide box is being used in leaders that are 
supposed to fill 56 pt of space. Five copies of the box will be used; \cleaders will 
produce 3pt of space, then the five boxes, then another 3pt of space. But \xleaders 
will produce 1 pt of space, then the first box, then another 1 pt of space, then the second 
box, . . . , then the fifth box, and 1 pt of space. 

(£)<£)>■ EXERCISE 21.4 

JL JL Suppose that a 10 pt-wide box is to fill 38 pt of space starting 91 pt from the 
left of its enclosing box. How many copies of the box will be produced by Meaders, 
\cleaders, and \xleaders? Where will the boxes be positioned, relative to the left 
edge of the enclosing box, in each of the three cases? 

(£)(£)*■ EXERCISE 21.5 

JL JL Assuming that the ‘ . ’ in the Meaderf ill macro on the previous page is only 
0.2 em wide, there is 0.4 ern of blank space at both sides of the one-em box. Therefore 
the Meaders construction will leave between 0.4 em and 1.4 em of blank space between 
the periods and the text at either end. Redefine Meaderfill so that the amount of 
blank space at either end will be between 0.1 em and 1.1 em, but the leaders on adjacent 
lines will still be aligned with each other. 

f Instead of giving a (box) in the leaders construction, you can give a (rule), 
which means either \hrule or \vrule, followed by optional height, width, 
and depth specifications as usual. The rule will then be made as wide as the corre- 
sponding (glue). This is a case where \hrule makes sense in horizontal mode, because 


line 

dots 
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depth 
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it gives a horizontal rule in text. For example, if the \leaderfill macro in our earlier 
illustration is changed to 

\def\leaderf ill{ \leaders\hrule\hf ill\ } 

then the results look like this: 

Alpha Omega 

The Beginning The Ending 


When a rule is used instead of a box, it fills the space completely, so there’s no difference 
between \leaders, \cleaders, and \xleaders. 

EXERCISE 21.6 

JL JL What does \leaders\vrule\hfill produce? 

($>')($>') Leaders work in vertical mode as well as in horizontal mode. In this case 
JL JL vertical glue (e.g., \vskip{glue) or \vfill) is used instead of horizontal glue, 
and \leaders produces boxes that are aligned so that the top of each repeated box 
has the same vertical position as the top of the smallest enclosing box, plus a multiple 
of the height-plus-depth of the repeated box. No interlineskip glue is placed between 
boxes in vertical leaders: the boxes are just stacked right on top of each other. 

&)(&) If you specify horizontal leaders with a box whose width isn’t positive, or if 
JL JL you specify vertical leaders with a box whose height-plus-depth isn’t positive, 
TjgX silently ignores the leaders and produces ordinary glue instead. 

(£)(£)*■ EXERCISE 21.7 

JL JL Explain how you can end a paragraph with a rule that is at least 10 pt long 
and extends all the way to the right margin, like this: 

/gb/gb Horizontal leaders differ slightly from horizontal glue, because they have height 
JL JL and depth when TjgX calculates the size of the enclosing box (even though the 
number of replications might be zero). Similarly, vertical leaders have width. 

(£)(£)*■ EXERCISE 21.8 

JL JL Demonstrate how to produce the following ‘TgXture’ 


T^T^T^T^XT^XTeXTeXTeXTeXT^X 

T^T^T^T^T^T^T^T^T^TftX 


by using vertical leaders inside of horizontal leaders. (The TgX logo lias been put into 
a rectangular box, and copies of this box have been packed together tightly.) 



► EXERCISE 21.9 

Use vertical leaders to solve exercise 20.1. 


The \overbrace and \underbrace macros of plain TjgX are constructed by 
combining characters with rules. Font cmexlO contains four symbols x. ^ 
each of which lias depth zero and height equal to the thickness of a rule that joins them 
properly. Therefore it’s easy to define \upbracefill and \downbracef ill macros so 
that you can obtain, e.g., 




horizontal rule in text 

TeX logo 

overbrace 

underbrace 

cmexlO 

upbracefill 

downbracefill 

braces 
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by saying ‘\hbox to 100pt{\downbracef ill}\hbox to 50pt{\upbracef ill}’ in ver- 
tical mode. Details of those macro definitions appear in Appendix B. 

/>)/>) The definition of \overrightarrow in Appendix B is more complex than that 
X X of \overbrace, because it involves a box instead of a rule. The fonts of plain 
T)gX are designed so that symbols like <— and — > can be extended with minus signs; 
similarly, -4= and => can be extended with equals signs. However, you can’t simply put 

the characters next to each other, because that leaves gaps (‘« ’ and “<===’); it is 

necessary to backspace a little between characters. An additional complication arises 
because the extension line in a long arrow might need to be some non-integer number 
of minus signs long. To solve this problem, the \rightarrowf ill macro in Appendix B 
uses \cleaders with a repeatable box consisting of the middle 10 units of a minus sign, 
where one unit is pg em. The leaders are preceded and followed by — and — there’s 
enough backspacing to compensate for up to 5 units of extra space, fore and aft, that 
\cleaders might leave blank. In this way a macro is obtained such that 

\hbox to 100pt{\rightarrowf ill} 

yields 1 »’. 

Now we know all about leaders. What about wlratsits? Well, whatsits have 
JL JL been provided as a general mechanism by which important special printing 
applications can be handled as extensions to T)gX. It’s possible for system wizards to 
modify the T)gX program, without changing too much of the code, so that new features 
can be accommodated at high speed instead of encoding them in macros. The author 
hopes that such extensions will not be made very often, because he doesn’t want incom- 
patible pseudo- T^gX systems to proliferate; yet he realizes that certain special books 
deserve a special treatment. Whatsits make it possible to incorporate new things into 
boxes without bending the existing conventions too much. But they make applications 
less portable from one machine to another. 

&)(&) Two kinds of whatsits are defined as part of all l)gX implementations. They 
JL JL aren’t really extensions to T)gX, but they are coded as if they were, so that 
they provide a model of how other extensions could be made. The first of these is con- 
nected with output to text files, and it involves the l)gX primitive commands \openout, 
\closeout, \write, and \immediate. The second is connected with special instructions 
that can be transmitted to printing devices, via TfgX’s \special command. 

The ability to write text files that can later be input by other programs (includ- 
JL JL ing T[gX) makes it possible to take care of tables of contents, indexes, and many 
other things. You can say ‘\openout(number) = {file name)’ and ‘\closeout{number}’ 
by analogy with the \openin and \closein commands of Chapter 20; the (number) 
must be between 0 and 15. The filename is usually extended with ‘.tei’ if it has no 
extension. There is a Write command that writes one line to a file, analogous to the 
\read command that reads one line; you say 

\write(number){(token list)} 


overrightarrow 
minus signs 
arrows 
equals signs 
right arrow fill 
whatsits 

extensions to Tf^X 

openout 

closeout 

write 

immediate 
special 
.tex 
log file 
terminal 


and the material goes out to the file that corresponds to the given stream number. 
If the (number) is negative or greater than 15, or if the specified stream has no file 
open for output, the output goes to the user’s log file, and to the terminal unless the 



Chapter 21: Making Boxes 


227 


number is negative. Plain TfrX has a \newwrite command that allocates output stream 
numbers from 0 to 15. Output streams are completely independent of input streams. 


/>)/>) However, the output actually takes place in a delayed fashion; the \openout, 
JL JL \closeout, and \write commands that you give are not performed when TfrX 
sees them. Instead, IjrX puts these commands into whatsit items, and places them 
into the current horizontal or vertical or math list that is being built. No actual 
output will occur until this whatsit is eventually shipped out to the dvi file, as part 
of a larger box. The reason for this delay is that Write is often used to make an 
index or table of contents, and the exact page on which a particular item will appear 
is generally unknown when the Write instruction occurs in mid-paragraph. TjjX is 
usually working ahead, reading an entire paragraph before breaking it into lines, and 
accumulating more than enough lines to fill a page before deciding what goes on the 
page, as explained in Chapters 14 and 15. Therefore a deferred writing mechanism is 
the only safe way to ensure the validity of page number references. 

The (token list) of a Write command is first stored in a whatsit without per- 
JL JL forming any macro expansion; the macro expansion takes place later, when 
TjtX is in the middle of a \shipout operation. For example, suppose that some para- 
graph in your document contains the text 


... For \write\inx{example : \the\countO}example , suppose ... 

Then the horizontal list for the paragraph will have a whatsit just before the word 
‘example’, and just after the interword space following ‘For’. This whatsit item con- 
tains the unexpanded token list ‘example: \the\countO’. It sits dormant while the 
paragraph is being broken into lines and put on the current page. Let’s suppose that 
this word ‘example’ (or some hyphenated initial part of it, like ‘ex-’) is shipped out on 
page 256. Then TjjX will write the line 


example : 256 


on output stream \inx, because the ‘\the\countO’ will be expanded at that time. 
Of course, Write commands are usually generated by macros; they are rarely typed 
explicitly in mid-paragraph. 


/>)/>) TpX defers \openout and \closeout commands by putting them into wlratsits 
JL JL too; thus, the relative order of output commands will be preserved, unless 
boxes are shipped out in some other order due to insertions or such things. 


/>)/>) Sometimes you don’t want TjrX to defer a Write or \openout or \closeout. 
JL JL You could say, e.g., ‘\shipout\hbox{\write. . .}’, but that would put an un- 
wanted empty page in your dvi file. So IjrX has another feature that gets around this 
problem: If you type ‘\immediate’ just before Write or \openout or \closeout, the 
operation will be performed immediately, and no whatsit will be made. For example, 


\immediate\writel6{Goodbye} 

prints ‘Goodbye’ on your terminal. Without the \immediate, you wouldn’t see the 
‘Goodbye’ until the current list was output. (In fact, you might never see it; or you 
may see it more than once, if the current list goes into a box that was copied.) An 
‘\immediate\writel6’ differs from \message in that Write prints the text on a line 
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by itself; the results of several \message commands might appear on the same line, 
separated by spaces. 

The (token list) of a \write ought to be rather short, since it makes one line 
JL JL of output. Some implementations of TjrX are unable to write long lines; if you 
want to write a lot of stuff, just give several \ write commands. Alternatively, you can 
set TjjX’s \newlinechar parameter to the ASCII code number of some character that 
you want to stand for “begin a new line” ; then TgX will begin a new line whenever it 
would ordinarily output that character to a file. For example, one way to output two 
lines to the terminal in a single \write command is to say 

\newl inechar= ‘ \ “ J 
\immediate\writel6{Two~~ Jlines . } 


Each \write command produces output in the form that TjjX always uses to dis- 
play token lists symbolically: Characters represent themselves (except that you get 
duplicated characters like ## for macro parameter characters); unexpandable control 
sequence tokens produce their names, preceded by the \escapechar and followed by 
a space (unless the name is an active character or a control sequence formed from a 
single nonletter). 



TgX ignores \write, \openout, and \closeout whatsits that appear within 
boxes governed by leaders. If you are upset about this, you shouldn’t be. 


/>)/>) Since the (token list) of a deferred \write is expanded at a fairly random time 
JL JL (when \shipout occurs), you should be careful about what control sequences 
it is allowed to contain. The techniques of Chapter 20 for controlling macro expansion 
often come in handy with respect to \ write. 


EXERCISE 21.10 

JL JL Suppose that you want to \write a token list that involves a macro \chapno, 
containing the current chapter number, as well as ‘\the\countO’ which refers to the 
current page. You want \chapno to be expanded immediately, because it might, change 
before the token list is written; but you want \the\countO to be expanded at the time 
of \shipout. How can you manage this? 

/>)/>) Now let’s wrap up our study of boxes by considering one more feature. The 
JL JL command ‘\special{(token list)}’ can be given in any mode. Like \write, 
it puts its token list into a whatsit; and like \message, it expands the token list im- 
mediately. This token list will be output to the dvi file with the other typesetting 
commands that TjjX produces. Therefore it is implicitly associated with a particular 
position on the page, namely the reference point that would have been present if a box 
of height, depth, and width zero had appeared in place of the whatsit. The (token list) 
in a \special command should consist of a keyword followed if necessary by a space 
and appropriate arguments. For example, 


\special{halftone picl} 


might mean that a picture on file picl should be inserted on the current page, with 
its reference point at the current position. TjrX doesn’t look at the token list to see 
if it makes any sense; the list is simply copied to the output. However, you should 
be careful not to make the list too long, or you might overflow TjjX’s string memory. 
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The \special command enables you to make use of special equipment that might be 
available to you, e.g., for printing books in glorious TgXnicolor. 

Software programs that convert dvi files to printed or displayed output should 
JL JL be able to fail gracefully when they don’t recognize your special keywords. 
Thus, \special operations should never do anything that changes the current position. 
Whenever you use \special, you are taking a chance that your output file will not be 
printable on all output devices, because all \special functions are extensions to TJrX. 
However, the author anticipates that certain standards for common graphic operations 
will emerge in the TjgX user community, after careful experiments have been made by 
different groups of people; then there will be a chance for some uniformity in the use 
of \special extensions. 



TjrX will report the badness of glue setting in a box if you ask for the numeric 
quantity \badness after making a box. For example, you might say 


\setboxO=\line{\trialtexta} 

\ifnum\badness>250 \setboxO=\line{\trialtextb}\f i 


The badness is between 0 and 10000 unless the box is overfull, when \badness=1000000. 


color 

badness 

BARROUGH 

SHAW 


If age or weaknes doe prohibyte bloudletting, 
you must use boxing. 
— PHILIP BARROUGH, The Methode of Phisicke (1583) 

The only thing that never looks right is a rule. 
There is not in existence a page with a rule on it 
that cannot be instantly and obviously improved 
by taking the rule out. 
— GEORGE BERNARD SHAW, in The Dolphin (1940) 
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Printers charge extra when you ask them to typeset tables, and they do so for 
good reason: Each table tends to have its own peculiarities, so it’s necessary 
to give some thought to each one, and to fiddle with alternative approaches 
until finding something that looks good and communicates well. However, you 
needn’t be too frightened of doing tables with TgX, since plain TgX has a “tab” 
feature that handles simple situations pretty much like you would do them on a 
typewriter. Furthermore, TgX has a powerful alignment mechanism that makes 
it possible to cope with extremely complex tabular arrangements. Simple cases 
of these alignment operations will suffice for the vast majority of applications. 

Let’s consider tabbing first. If you say ‘\settabs n \columns’, plain 
TgX makes it easy to produce lines that are divided into n equal-size columns. 

Each line is specified by typing 

\+{texti)&{texto)& • • • \cr 

where (texti) will start flush with the left margin, (texto) will start at the left of 
the second column, and so on. Notice that ‘\+’ starts the line. The final column 
is followed by ‘\cr’, which old-timers will recognize as an abbreviation for the 
“carriage return” operation on typewriters that had carriages. For example, 
consider the following specification: 

\settabs 4 \columns 

\+&&Text that starts in the third column\cr 
\+&Text that starts in the second column\cr 
\+\it Text that starts in the first column, and&&& 
the fourth, and&beyond ! \cr 

After ‘\settabs4\columns’ each \+ line is divided into quarters, so the result is 

: : Text that starts in the third column : 

: Text that starts in the second column : : 

'.Text that starts in the first column, and the fourth, and beyond! 

This example merits careful study because it illustrates several things. 

(1) The is like the TAB key on many typewriters; it tells TpX to advance 
to the next tab position, where there’s a tab at the right edge of each column. 

In this example, TgX has set up four tabs, indicated by the dashed lines; a 
dashed line is also shown at the left margin, although there isn’t really a tab 
there. (2) But isn’t exactly like a mechanical typewriter TAB, because it first 
backs up to the beginning of the current column before advancing to the next. 

In this way you can always tell what column you’re tabbing to, by counting 
the number of &’s; that’s handy, because variable-width type otherwise makes 
it difficult to know whether you’ve passed a tab position or not. Thus, on the 
last line of our example, three &’s were typed in order to get to column 4, even 
though the text had already extended into column 2 and perhaps into column 3. 

(3) You can say ‘\cr’ before you have specified a complete set of columns, if the 
remaining columns are blank. (4) The &’s are different from tabs in another way, 
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tabbing 

settabs 

columns 

columns 

+ 

cr 

carriage return 
ampersand 
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too: TpX ignores spaces after “&’, hence you can conveniently finish a column 
by typing “&’ at the end of a line in your input file, without worrying that an 
extra blank space will be introduced there. (The second-last line of the example 
ends with ‘&’, and there is an implicit blank space following that symbol; if Tj^X 
hadn’t ignored that space, the words ‘the fourth’ wouldn’t have started exactly 
at the beginning of the fourth column.) Incidentally, plain T^X also ignores 
spaces after ‘\+’, so that the first column is treated like the others. (5) The 
‘\it’ in the last line of the example causes only the first column to be italicized, 
even though no braces were used to confine the range of italics, because Tj^X 
implicitly inserts braces around each individual entry of an alignment. 

f Once you have issued a \settabs command, the tabs remain set until you 
reset them, even though you go ahead and type ordinary paragraphs as usual. 
But if you enclose \settabs in {...}, the tabs defined inside a group don’t affect the 
tabs outside; ‘\global\settabs’ is not permitted. 


f Tabbed lines usually are used between paragraphs, in the same places where 
you would type Mine or \centerline to get lines with a special format. But 
it’s also useful to put \+ lines inside a \vbox; this makes it convenient to specify displays 
that contain aligned material. For example, if you type 


$$\vbox{\settabs 3 \columns 

\+This is&a strange&example\cr 

\+of displayed&three-column&f ormat . \cr}$$ 


you get the following display: 

This is a strange example 

of displayed three-column format. 


In this case the first column doesn’t appear flush left, because TjgX centers a box that 
is being displayed. Columns that end with \cr in a \+ line are put into a box with 
their natural width; so the first and second columns here are one-third of the \hsize, 
but the third column is only as wide as the word ‘example’. We have used $$ in this 
construction even though no mathematics is involved, because $$ does other useful 
things; for example, it centers the box, and it inserts space above and below. 

People don’t always want tabs to be equally spaced, so there’s another 
way to set them, by typing ‘\+{sample line)\cr’ immediately after ‘\settabs’. 
In this case tabs are placed at the positions of the &’s in the sample line, and 
the sample line itself does not appear in the output. For example, 

\settabs\+\indent&Horizontal lists\quad&\cr 7 sample line 
\+&Horizontal lists&Chapter 14\cr 
\+&Vertical lists&Chapter 15\cr 
\+&Math lists&Chapter 17\cr 


causes TgX to typeset the following three lines of material: 

Horizontal lists Chapter 14 
Vertical lists Chapter 15 
Math lists Chapter 17 


spaces 

it 

braces 

global 

line 

centerline 
displays 
dollardollar 
sample line 
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The \settabs command in this example makes column 1 as wide as a paragraph 
indentation; and column 2 is as wide as ‘Horizontal lists’ plus one quad of space. 
Only two tabs are set in this case, because only two &’s appear in the sample 
line. (A sample line might as well end with &, because the text following the 
last tab isn’t used for anything.) 

The first line of a table can’t always be used as a sample line, because it 
won’t necessarily give the correct tab positions. In a large table you have to look 
ahead and figure out the biggest entry in each column; the sample line is then 
constructed by typing the widest first column, the widest second column, etc., 
omitting the last column. Be sure to include some extra space between columns 
in the sample line, so that the columns won’t touch each other. 

► EXERCISE 22.1 

Explain how to typeset the following table [from Beck, Bertholle, and Child, 
Mastering the Art of French Cooking (New York: Knopf, 1961)]: 


Weight 

Servings 

Approximate Cooking Time* 

8 lbs. 

6 

1 hour and 50 to 55 minutes 

9 lbs. 

7 to 8 

About 2 hours 

9!/o lbs. 

8 to 9 

2 hours and 10 to 15 minutes 

IO 1/2 lbs. 

9 to 10 

2 hours and 15 to 20 minutes 


* For a stuffed goose, add 20 to 40 minutes to the times given. 



If you want to put something flush right in its column, just type ‘\hfilT 
before it; and be sure to type ‘&’ after it, so that l^X will be sure to move the 


information all the way until it touches the next tab. Similarly, if you want to center 


something in its column, type ‘\hfilT before it and ‘\hfillft’ after it. For example, 


\settabs 2 \columns 

\+\hfill This material is set flush right& 

\hfill This material is centered\hf ill&\cr 
\+\hfill in the first half of the line. ft 

\hfill in the second half of the line . \hf ill&\cr 


produces the following little table: 

This material is set flush right This material is centered 

in the first half of the line. in the second half of the line. 

The \+ macro in Appendix B works by putting the (text) for each column 
that’s followed by ft into an libox as follows: 



\hbox to (column width){(text)\hss} 


The \hss means that the text is normally flush left, and that it can extend to the right 
of its box. Since \hfill is “more infinite” than \hss in its ability to stretch, it has the 
effect of right-justifying or centering as stated above. Note that \hfill doesn’t shrink, 
but \hss does; if the text doesn’t fit in its column, it will stick out at the right. You 
could cancel the shrinkability of \hss by adding \hf ilneg; then an oversize text would 
produce an overfull box. You could also center some text by putting ‘\hss’ before it and 


indention, see indentation 
quad 

Beck, Simone 

Bertholle, Louisette 

Child, Julia 

flush right 

hfill 

center 

hss 

hfilneg 
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just ‘ft’ after it; in that case the text would be allowed to extend to the left and right of 
its column. The last column of a \+ line (i.e. , the column entry that is followed by \cr) 
is treated differently: The (text) is simply put into an hbox with its natural width. 


f Computer programs present difficulties of a different kind, since some people 
like to adopt a style in which the tab positions change from line to line. For 
example, consider the following program fragment: 


if n < r then n := n + 1 

else begin print-totals; n := 0; 

end; 

while p > 0 do 

begin q := link(p ); free-node(p); p := q; 

end; 


Special tabs have been set up so that ‘then’ and ‘else’ appear one above the other, 
and so do ‘begin’ and ‘end’, ft’s possible to achieve this by setting up a new sample 
line whenever a new tab position is needed; but that’s a tedious job, so plain TpX 
makes it a little simpler. Whenever you type ft to the right of all existing tabs, the 
effect is to set a new tab there, in such a way that the column just completed will have 
its natural width. Furthermore, there’s an operation ‘\cleartabs’ that resets all tab 
positions to the right of the current column. Therefore the computer program above 
can be IjrXified as follows: 



$$\vbox{\+\bf if $n<r$ \cleartabsft\bf then $n:=n+l$\cr 

\+ft\bf else &{\bf begin} ${\it print\_totals}$ ; $n:=0$;\cr 
\+ftft{\bf end};\cr 

(The remaining part is left as an exercise)}$$ 

► EXERCISE 22.2 

Complete the example computer program by specifying three more \+ lines. 

Although \+ lines can be used in vertical boxes, you must never use \+ inside 
of another \+ line. The \+ macro is intended for simple applications only. 


The \+ and \settabs macros of Appendix B keep track of tabs by maintaining 
JL JL register \box\tabs as a box full of empty boxes whose widths are the column 
widths in reverse order. Thus you can examine the tabs that are currently set, by 
saying ‘\showbox\tabs’; this puts the column widths into your log file, from right to 
left. For example, after ‘\settabs\+\hskipl00pt&\hskip200ptft\cr\showbox\tabs’, 
TjgX will show the lines 


\hbox (0 . 0+0 . 0)x300. 0 
. \hbox(0. 0+0. 0)x200 . 0 
. \hbox(0. 0+0. 0)xl00 . 0 


/>)/>>► EXERCISE 22.3 

JL JL Study the \+ macro in Appendix B and figure out how to change it so that 
tabs work as they do on a mechanical typewriter (i.e., so that ‘&’ always moves to 
the next tab that lies strictly to the right of the current position). Assume that 
the user doesn’t backspace past previous tab positions; for example, if the input is 
‘\+ftft\hskip-2emftx\cr’, do not bother to put ‘x’ in the first or second column, just 
put it at the beginning of the third column. (This exercise is a bit difficult.) 


Computer programs 

cleartabs 

tabs 

showbox 
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TjyX has another important way to make tables, using an operation called 
\halign (“horizontal alignment”). In this case the table format is based on 
the notion of a template, not on tabbing; the idea is to specify a separate environment 
for the text in each column. Individual entries are inserted into their templates, and 
presto, the table is complete. 



f For example, let’s go back to the Horizontal/ Vertical/Matli list example that 
appeared earlier in this chapter; we can specify it with \halign instead of 
with tabs. The new specification is 


\halign{\ indent #\hf il&\quad#\hf il\cr 
Horizontal lists&Chapter 14\cr 
Vertical lists&Chapter 15\cr 
Math lists&Chapter 17\cr} 


and it produces exactly the same result as the old one. This example deserves careful 
study, because \halign is really quite simple once you get the hang of it. The first 
line contains the preamble to the alignment, which is something like the sample line 
used to set tabs for \+. In this case the preamble contains two templates, namely 
‘\indent#\hf il 1 for the first column and ‘\quad#\hf il’ for the second. Each template 
contains exactly one appearance of *#’, and it means “stick the text of each column entry 
in this place.” Thus, the first column of the line that follows the preamble becomes 


\indent Horizontal lists\hfil 


when ‘Horizontal lists’ is stuffed into its template; and the second column, similarly, 
becomes ‘\quad Chapter 14\hfil’. The question is, why \hfil? Ah, now we get to the 
interesting point of the whole thing: Tj?X reads an entire \halign{. . .} specification 
into its memory before typesetting anything, and it keeps track of the maximum width 
of each column, assuming that each column is set without stretching or shrinking the 
glue. Then it goes back and puts every entry into a box, setting the glue so that each 
box has the maximum column width. That’s where the \hfil conies in; it stretches to 
fill up the extra space in narrower entries. 

►EXERCISE 22.4 

JL What table would have resulted if the template for the first column in this 
example had been ‘\indent\hf il#’ instead of ‘\indent#\hf il’ ? 


f Before reading further, please make sure that you understand the idea of tem- 
plates in the example just presented. There are several important differences 
between \halign and \+: (1) \halign calculates the maximum column widths auto- 
matically; you don’t have to guess what the longest entries will be, as you do when 
you set tabs with a sample line. (2) Each \halign does its own calculation of column 
widths; you have to do something special if you want two different \halign operations 
to produce identical alignments. By contrast, the \+ operation remembers tab positions 
until they are specifically reset; any number of paragraphs and even \halign operations 
can intervene between \+’s, without affecting the tabs. (3) Because \halign reads an 
entire table in order to determine the maximum column widths, it is unsuitable for 
huge tables that fill several pages of a book. By contrast, the \+ operation deals with 
one line at a time, so it places no special demands on IjjX’s memory. (However, if 
you have a huge table, you should probably define your own special-purpose macro 
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for each line instead of relying on the general \+ operation.) (4) \halign takes less 
computer time than \+ does, because \halign is a built-in command of Tj^X, while 
\+ is a macro that has been coded in terms of \halign and various other primitive 
operations. (5) Templates are much more versatile than tabs, and they can save you a 
lot of typing. For example, the Horizontal/ Vertical/Math list table could be specified 
more briefly by noticing that there’s common information in the columns: 

\halign{\indent# listsXhf il&\quad Chapter #\cr 
Horizontalftl4\cr Verticalftl5\cr Mathftl7\cr} 


You could even save two more keystrokes by noting that the chapter numbers all start 
with ‘I’ ! (Caution: It takes more time to think of optimizations like this than to type 
things in a straightforward way; do it only if you’re bored and need something amusing 
to keep up your interest.) (6) On the other hand, templates are no substitute for tabs 
when the tab positions are continually varying, as in the computer program example. 



Let’s do a more interesting table, to get more experience with \halign. Here 
is another example based on the Beck/Bertholle/Child book cited earlier: 


American 

French 

Age 

Weight 

Cooking 

Chicken 

Connection 

(months) 

(lbs.) 

Methods 

Squab 

Poussin 

2 

s/ 4 to 1 

Broil, Grill, Roast 

Broiler 

Poulet Nouveau 

2 to 3 

iy 2 to 21/2 

Broil, Grill, Roast 

Fryer 

Poulet Reine 

3 to 5 

2 to 3 

Fry, Saute, Roast 

Roaster 

Poularde 

5y 2 to 9 

Over 3 

Roast, Poach, Fricassee 

Fowl 

Poule de I’Annee 

10 to 12 

Over 3 

Stew, Fricassee 

Rooster 

Coq 

Over 12 

Over 3 

Soup stock, Forcemeat 


Note that, except for the title lines, the first column is set right-justified in boldface 
type; the middle columns are centered; the second column is centered and in italics; 
the final column is left-justified. We would like to be able to type the rows of the table 
as simply as possible; hence, for example, it would be nice to be able to specify the 
bottom row by typing only 

Rooster&Coq&Over 12&0ver 3&Soup stock, ForcemeatXcr 

without worrying about type styles, centering, and so on. This not only cuts down on 
keystrokes, it also reduces the chances for making typographical errors. Therefore the 
template for the first column should be ‘\hf il\bf#’; for the second column it should be 
‘\hf il\it#\hfil’ to get the text centered and italicized; and so on. We also need to al- 
low for space between the columns, say one quad. Voila! La typographic est sur la table: 


\halign{\hf il\bf#ft\quad\hf il\it#\hf il&\quad\hf il#\hf ilft 
\quad\hf il#\hf ilft\quad#\hf il\cr 
(the title lines) 

SquabftPoussin&2ft\frac3/4 to l&Broil, Grill, RoastXcr 
. . . ForcemeatXcr} 


Beck 

Bertholle 

Child 


As with the \+ operation, spaces are ignored after &, in the preamble as well as in the 
individual rows of the table. Thus, it is convenient to end a long row with ‘ft’ when 
the row takes up more than one line in your input file. 
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► EXERCISE 22.5 

How was the ‘Fowl’ line typed? (This is too easy.) 


f Tlie only remaining problem in this example is to specify the title lines, which 
have a different format from the others. In this case the style is different only 
because the typeface is slanted, so there’s no special difficulty; we just type 


\sl American&Xsl French&Xsl Age&\sl Weight&Xsl CookingXcr 
\sl Chicken&Xsl Connection&Xsl (months)&\sl (lbs . )&\sl MethodsXcr 


It is necessary to say ‘\sl’ each time, because each individual entry of a table is 
implicitly enclosed in braces. 

f Tlie author used ‘\openup2pt’ to increase the distance between baselines in 
the poultry table; a discriminating reader will notice that there’s also a bit of 
extra space between the title line and the other lines. This extra space was inserted by 
typing ‘\noalign{\smallskip}’ just after the title line. In general, you can say 

Xnoalignf (vertical mode material)} 


just after any \cr in an Xhalign; TpX will simply copy the vertical mode material, 
without subjecting it to alignment, and it will appear in place when the Xhalign is 
finished. You can use Xnoalign to insert extra space, as here, or to insert penalties 
that affect page breaking, or even to insert lines of text (see Chapter 19). Definitions 
inside the braces of \noalign{. . .} are local to that group. 

f Tlie Xhalign command also makes it possible for you to adjust the spacing 
between columns so that a table will fill a specified area. You don’t have to 
decide that the inter-column space is a quad; you can let IjgX make the decisions, 
based on how wide the columns come out, because TjgX puts “tabskip glue” between 
columns. This tabskip glue is usually zero, but you can set it to any value you like by 
saying ‘\tabskip=(glue) ’. For example, let’s do the poultry table again, but with the 
beginning of the specification changed as follows: 

\tabskip=lem plus2em minus. 5em 

Xhalign to\hsize{\hf il\bf #&\hf il\it#\hf ilftXhf il#\hf il& 

\hf il#\hf il&#\hf il\cr 


The main body of the table is unchanged, but the Xquad spaces have been removed 
from the preamble, and a nonzero Xtabskip has been specified instead. Furthermore 
‘Xhalign’ has been changed to ‘Xhalign toXhsize’; this means that each line of the 
table will be put into a box whose width is the current value of Xhsize, i.e., the 
horizontal line width usually used in paragraphs. The resulting table looks like this: 


American 

French 

Age 

Weight 

Cooking 

Chicken 

Connection 

(months) 

(lbs.) 

Methods 

Squab 

Poussin 

2 

3/j to 1 

Broil, Grill, Roast 

Broiler 

Poulet Nouveau 

2 to 3 

IV 2 to 272 

Broil, Grill, Roast 

Fryer 

Poulet Reine 

3 to 5 

2 to 3 

Fry, Saute, Roast 

Roaster 

Poularde 

5 72 to 9 

Over 3 

Roast, Poach, Fricassee 

Fowl 

Poule de I’Annee 

10 to 12 

Over 3 

Stew, Fricassee 

Rooster 

Coq 

Over 12 

Over 3 

Soup stock, Forcemeat 


openup 
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inter-column space 
tabskip glue 
tabskip 
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f in general, Tj^X puts tabskip glue before the first, column, after the last column, 
and between the columns of an alignment. You can specify the final aligned 
size by saying ‘\halign to(dimen)’ or ‘\halign spread{dimen}’, just as you can say 
‘\hbox to(dimen)’ and ‘\hbox spread(dimen)’. This specification governs the setting 
of the tabskip glue; but it does not affect the setting of the glue within column entries. 
(Those entries have already been packaged into boxes having the maximum natural 
width for their columns, as described earlier. ) 

/gN/gN Therefore ‘\halign to \hsize’ will do nothing if the tabskip glue lias no 
JL JL stretchability or shrinkability, except that it will cause TjrjX to report an 
underfull or overfull box. An overfull box occurs if the tabskip glue can’t shrink to 
meet the given specification; in this case you get a warning on the terminal and in your 
log file, but there is no “overfull rule” to mark the oversize table on the printed output. 
The warning message shows a “prototype row” (see Chapter 27). 

f The poultry example just given used the same tabskip glue everywhere, but 
you can vary it by resetting \tabskip within the preamble. The tabskip glue 
that is in force when TjrjX reads the following \halign will be used before the first 
column; the tabskip glue that is in force when TpjX reads the after the first template 
will be used between the first and second columns; and so on. The tabskip glue that 
is in force when TjgX reads the \cr after the last template will be used after the last 
column. For example, in 

\tabskip=3pt 

\halign{\hf il#\tabskip=4pt& #\hf il& 

\hbox to 10em{\hss\tabskip=5pt # \hss}\cr ...} 

the preamble specifies aligned lines that will consist of the following seven parts: 


tabskip glue 3 pt; 

first column, with template ‘\hfil#’; 
tabskip glue 4 pt; 

second column, with template ‘#\hf il’; 
tabskip glue 4 pt; 

third column, with template ‘\hbox to 10em{\hss# \hss}’; 
tabskip glue 5 pt. 

/gK/gK TgX copies the templates without interpreting them except to remove any 
JL JL \tabskip glue specifications. More precisely, the tokens of the preamble are 
passed directly to the templates without macro expansion; T|’X looks only for ‘\cr’ 
commands, ‘St 1 , “#’, ‘\span’, and ‘\tabskip’. The (glue) following ‘\tabskip’ is scanned 
in the usual way (with macro expansion), and the corresponding tokens are not in- 
cluded in the current template. Notice that, in the example above, the space after 
‘5pt’ also disappeared. The fact that \tabskip=5pt occurred inside an extra level of 
braces did not make the definition local, since TgX didn’t “see” those braces; similarly, 
if \tabskip had been preceded by ‘\global’, TjjX wouldn’t have made a global defini- 
tion, it would just have put ‘\global’ into the template. All assignments to \tabskip 
within the preamble are local to the \halign (unless \globaldefs is positive), so the 
value of \tabskip will be 3pt again when this particular \halign is completed. 



When ‘\span’ appears in a preamble, it causes the next token to be expanded, 
i.e., “ex-span-ded,” before Tj^X reads on. 


to 

spread 
underfull 
overfull 
overfull rule 
prototype row 
tabskip 
globaldefs 
span 



Chapter 22: Alignment 239 



► EXERCISE 22.6 

Design a preamble for the following table: 


England P. Philips 1560-1628 Netherlands J. P. Sweelinck 1562-1621 

J. Bull cl563-1628 P. Cornet cl570-1633 

Germany H. L. Hassler 1562-1612 Italy G. Frescobaldi 1583-1643 

M. Praetorius 1571-1621 Spain F. Correa de Arauxo cl576-1654 

France J. Titelouze 1563-1633 Portugal M. R. Coelho cl555-el635 


The tabskip glue should be zero at the left and right of each line; it should be 1 em 
plus 2em in the center; and it should be .5em plus .5em before the names, Oem plus 
.5em before the dates. Assume that the lines of the table will be specified by, e.g., 


FranceftJ. Titelouzeftl563 — 1633ft 

Portugal&M. E. Coelho&\\1555— \\1635\cr 


where l \V has been predefined by ‘\def\\{{\it c\/}}’. 



► EXERCISE 22.7 

Design a preamble so that the table 


rydw i = I am 
rwyt ti = thou art 
mae e = he is 
mae hi = she is 
rydyn ni = we are 
rydych chi = you are 
maen nhw = they are 


ydw i = am I 
wyt ti = art thou 
ydy e = is he 
ydy hi = is she 
ydyn ni = are we 
ydych chi = are you 
ydyn nhw = are they 


can be specified by typing lines like 


roeddwn i = I was 
roeddet ti = thou wast 
roedd e = he was 
roedd hi = she was 
roedden ni = we were 
roeddech chi = you were 
roedden nhw = they were 


mae hi=she is&ydy hi=is sheftroedd hi=she was\cr 


organists 

Cornet, Peeter 

Philips, Peter 

Sweelinck, Jan Pieterszoon 

Bull, John 

Titelouze, Jehan 

Hassler, Hans Leo 

Prsetorius [Schultheiss], Michael 

Frescobaldi, Girolamo 

Coelho, Manuel Rodrigues 

Correa de Arauxo, Francisco 

Welsh conjugation 

Durant 

Aristippus of Cyrene 

Antisthenes of Athens 

Plato 

Xenophon 

Aristophanes 

Dionysius I of Syracuse 

Isocrates 

Evagoras of Salamis 
Archytas of Taras 



► EXERCISE 22.8 

The line breaks in the second 


column of the table at the right were 
chosen by T^X so that the second col- 
umn was exactly 16 ems wide. Fur- 
thermore, the author specified one of 
the rows of the table by typing 


\\393&Plato ’ s {\sl ApologyX/}; 
Xenophon’s 
{\sl MemorabiliaX/} ; 
Aristophanes ’ 

{\sl Ecclesiazus\ae\/}\cr 


Can you guess what preamble was used 
in the alignment? [The data comes 
from Will Durant’s The Life of Greece 
(Simon & Schuster, 1939).] 


B.C. 

397: War between Syracuse and Carthage 
396: Aristippus of Cyrene and Antisthe- 
nes of Athens (philosophers) 

395: Athens rebuilds the Long Walls 
394: Battles of Coronea and Cnidus 
c393: Plato’s Apology; Xenophon’s Memo- 
rabilia; Aristophanes’ Ecclesiazusae 
391-87: Dionysius subjugates south Italy 
391: Isocrates opens his school 
390: Evagoras Hellenizes Cyprus 
387: “King’s Peace’’; Plato visits Archy- 
tas of Taras (mathematician) and 
Dionysius I 

386: Plato founds the Academy 

383: Spartans occupy Cadmeia at Thebes 

380: Isocrates’ Panegyricus 
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f Sometimes a template will apply perfectly to all but one or two of the entries 
in a column. For example, in the exercise just given, the colons in the first 
column of the alignment were supplied by the template ‘\hfil#: u ’; but the very first 
entry in that column, ‘B.C.’, did not have a colon. T^X allows you to escape from the 
stated template in the following way: If the very first token of an alignment entry is 
‘\omit’ (after macro expansion), then the template of the preamble is omitted; the 
trivial template ‘#’ is used instead. For example, ‘B.C.’ was put into the table above 
by typing ‘\omit\hf il\sevenrm B.C. 1 immediately after the preamble. You can use 
\omit in any column, but it must come first; otherwise TjrX will insert the template 
that was defined in the preamble. 

/gb/gb If you think about what IjjX has to do when it’s processing \halign, you’ll 
JL JL realize that the timing of certain actions is critical. Macros are not expanded 
when the preamble is being read, except as described earlier; but once the \cr at the 
end of the preamble has been sensed, TjjX must look ahead to see if the next token is 
\noalign or \omit, and macros are expanded until the next non-space token is found. 
If the token doesn’t turn out to be \noalign or \omit, it is put back to be read again, 
and Tj?X begins to read the template (still expanding macros). The template has two 
parts, called the u and v parts, where u precedes the ‘#’ and v follows it. When TpX 
has finished the u part, its reading mechanism goes back to the token that was neither 
\noalign nor \omit, and continues to read the entry until getting to the ft or \cr that 
ends the entry; then the v part of the template is read. A special internal operation 
called \endtemplate is always placed at the end of the v part; this causes Tj?X to put 
the entry into an “unset box” whose glue will be set later when the final column width 
is known. Then T^X is ready for another entry; it looks ahead for \omit (and also for 
\noalign, after \cr) and the process continues in the same way. 

(&)&) One consequence of the process just described is that it may be dangerous 
JL JL to begin an entry of an alignment with \if . . ., or with any macro that will 
expand into a replacement text whose first token is \if . . .; the reason is that the 
condition will be evaluated before the template has been read. (Tj?X is still looking 
to see whether an \omit will occur, when the \if is being expanded.) For example, if 
\strut has been defined to be an abbreviation for 


\ifmmode(text for math modes)\else{text for nonmath modes)\fi 

and if \strut appears as the first token in some alignment entry, then Tj?X will expand 
it into the (text for nonmatli modes) even though the template might be *$#$’, because 
TjgX will not yet be in math mode when it is looking for a possible \omit. Chaos will 
probably ensue. Therefore the replacement text for \ strut in Appendix B is actually 

\relax\ifmmode . . . 


and ‘\relax 1 has also been put into all other macros that might suffer from such timing 
problems. Sometimes you do want TjrX to expand a conditional before a template is 
inserted, but careful macro designers watch out for cases where this could cause trouble. 

f When you’re typesetting numerical tables, it’s common practice to line up the 
decimal points in a column. For example, if two numbers like ‘0.2010’ and 
‘297.1’ both appear in the same column, you’re supposed to produce ‘ 297 i° 10 ’• This 
result isn’t especially pleasing to the eye, but that’s what people do, so you might 


omit 

endtemplate 
unset box 
conditionals 
strut 
ifmmode 
numerical tables 
decimal points 
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have to conform to the practice. One way to handle this is to treat the column as two 
columns, somewhat as \eqalign treats one formula as two formulas; the can be 
placed at the beginning of the second half-column. But the author usually prefers to 
use another, less sophisticated method, which takes advantage of the fact that the digits 
0, 1, . . . , 9 have the same width in most fonts: You can choose a character that’s not 
used elsewhere in the table, say “?’, and change it to an active character that produces 
a blank space exactly equal to the width of a digit. Then it’s usually no chore to put 
such nulls into the table entries so that each column can be regarded as either centered 
or right-justified or left-justified. For example, ‘??0.2010’ and ‘297.1???’ have the 
same width, so their decimal points will line up easily. Here is one way to set up “?’ 
for this purpose: 

\newdimen\digitwidth 

\setboxO=\hbox{\rmO} 

\digitwidth=\wdO 
\catcode ‘ ?=\active 
\def?{\kern\digitwidth} 

The last two definitions should be local to some group, e.g., inside a \vbox, so that “?’ 
will resume its normal behavior when the table is finished. 



Let’s look now at some applications to mathematics. Suppose first that you 
want to typeset the small table 

n = 0123456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 ... 
g(n) = 1 2 4 3 6 7 8 16 18 25 32 11 64 31 128 10 256 5 512 28 1024 . . . 


as a displayed equation. A brute force approach using \eqalign or \atop is cumbersome 
because Q{n) and n don’t always have the same number of digits. It would be much 
nicer to type 


$$\vbox{\halign{ (preamble) \cr 

n\phantom)&0&l&2&3& . . . &20&\dots\cr 
{\cal G} (n)&l&2&4&3ft ... &1024&\dots\cr}}$$ 


for some (preamble). On the other hand, the (preamble) is sure to be long, since 
this table has 23 columns; so it looks as though \settabs and \+ will be easier. TpX 
has a handy feature that helps a lot in cases like this: Preambles often have a periodic 
structure, and if you put an extra ‘ft’ just before one of the templates, TjrX will consider 
the preamble to be an infinite sequence that begins again at the marked template when 
the \cr is reached. For example, 

f i ft t-2 & t.3 &ft t.4 ft to \cr is treated like t.i St t.2 ft t.3 ft 1 . 4 ft t - 0 ft t.4 ft t 5 ft t.4 & 

and 

&fi &t-2 ft t.3 ft &fs \cr is treated like fi & t.2 ft t3 & t.4 & fs & ti & <2 ft t.3 ft • • • . 


The tabskip glue following each template is copied with that template. The preamble 
will grow as long as needed, based on the number of columns actually used by the 
subsequent alignment entries. Therefore all it takes is 


active character 

group 

active 

display 

periodic preambles 
cyclic preambles 
ampersand ampersand 
ampersand 


$\hfil#$ =&&\ \hfil#\hfil\cr 
to make a suitable (preamble) for the Q{n) problem. 
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Now suppose that the task is to typeset three pairs of displayed formulas, with 
all of the = signs lined up: 


Vi = Vi —qn’j, Xj = x-i — qiXj , Ui = «», for i ^ j; 
Vj = vj, Xj = xj, Uj = Uj 4- QiUi- 


(23) 


It’s not easy to do this with three Xeqalign’s, because the ^ with a subscript H ^ j' 
makes the right-hand pair of formulas bigger than the others; the baselines won’t agree 
unless “phantoms” are put into the other two Xeqalign’s (see Chapter 19). Instead of 
using Xeqalign, which is defined in Appendix B to be a macro that uses Xhalign, let’s 
try to use Xhalign directly. The natural way to approach this display is to type 


$$\vcenter{\openupl\jot \halign{ (preamble) \cr 
(first line)\cr (second line)\cr}}\eqno (23) $$ 


because the Xvcenter puts the lines into a box that is properly centered with respect 
to the equation number ‘(23)’; the Xopenup macro puts a bit of extra space between 
the lines, as mentioned in Chapter 19. 


OK, now let’s figure out how to type the (first line) and (second line). The 
JL JL usual convention is to put before the symbols that we want to line up, so 
the obvious solution is to type 


V_i&=v_i-q_iv_j ,&X_i&=x_i-q_ix_j ,& 

U_i&=u_i , \qquad\hbox{for $i\ne j$};\cr 
V_j&=v_j ,&X_j&=x_j , & 

U_j&=u_ j+\sum_{i\ne j}q_iu_i.\cr 


Thus the alignment has six columns. We could take common elements into the preamble 
(e.g., ‘VJ and ‘=v_’), but that would be too error-prone and too tricky. 

The remaining problem is to construct a preamble to support those lines. To 
JL JL the left of the = signs we want the column to be filled at the left; to the right 
of the = signs we want it to be filled at the right. There’s a slight complication because 
we are breaking a math formula into two separate pieces, yet we want the result to 
have the same spacing as if it were one formula. Since we’re putting the ‘&’ just before 
a relation, the solution is to insert ‘O’ at the beginning of the right-hand formula; TpX 
will put the proper space before the equals sign in ‘${}=. . but it puts no space 
before the equals sign in ‘$=. . .$’. Therefore the desired (preamble) is 

$\hf il#$&$0#\hf il$& 

\qquad$\hfil#$&$0#\hfil$& 

\qquad$\hf il#$&$0#\hf il$ 


The third and fourth columns are like the first and second, except for the Xqquad that 
separates the equations; the fifth and sixth columns are like the third and fourth. Once 
again we can use the handy ‘&&’ shortcut to reduce the preamble to 


$\hfil#$&&$0#\hf il$&\qquad$\hf il#$ 


eqalign 

phantoms 

jot 

vcenter 
openup 
lbrace rbrace 
ampersand ampersand 


With a little practice you’ll find that it becomes easy to compose preambles as you are 
typing a manuscript that needs them. However, most manuscripts don’t need them, so 
it may be a while before you acquire even a little practice in this regard. 
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/>)► EXERCISE 22.9 

JL Explain how to produce the following display: 

lOw -F 3 a~ + 3 y 4 - 18 * = 1 , 


6 w — 17 x 


- 5z = 2. 


The next level of complexity occurs when some entries of a table span two 
JL JL or more columns. TpX provides two ways to handle this. First there’s 
\hidewidth, which plain TgX defines to be equivalent to 

\hskip-1000pt plus If ill 

In other words, \hidewidth has an extremely negative “natural width,” but it will 
stretch without limit. If you put \hidewidth at the right of some entry in an alignment, 
the effect is to ignore the width of this entry and to let it stick out to the right of its 
box. (Think about it; this entry won’t be the widest one, when \halign figures the 
column width.) Similarly, if you put \hidewidth at the left of an entry, it will stick 
out to the left; and you can put \hidewidth at both left and right, as we’ll see later. 

/gb/gb The second way to handle table entries that span columns is to use the \span 
JL JL primitive, which can be used instead of in any line of the table. (We’ve 
already seen that \span means “expand” in preambles; but outside of preambles its 
use is completely different.) When ‘\span’ appears in place of the material before 
and after the \span is processed in the ordinary way, but afterward it is placed into a 
single box instead of two boxes. The width of this combination box is the sum of the 
individual column widths plus the width of the tabskip glue between them; therefore 
the spanning box will line up with non-spanning boxes in other rows. 

(&)&) For example, suppose that there are three columns, with the respective tem- 
JL JL plates U\t vi & «2 # V2 & U3 # v 3 ; suppose that the column widths are wi,u>2, W3 ; 
suppose that go, gi, gi, gs are the tabskip glue widths after the glue has been set; and 
suppose that the line 

«i\span a.2\span fl3\cr 

has appeared in the alignment. Then the material for l uiaiViU2a2ViU3a,3V3 (i.e., the 
result ‘ui aiVi ’ of column 1 followed by the results of columns 2 and 3 ) will be placed 
into an libox of width wi + gi + W2 + g2 + 1113. That hbox will be preceded by glue of 
width g 0 and it will be followed by glue of width g 3 , in the larger hbox that contains 
the entire aligned line. 

/>)/>) You can use \omit in conjunction with \span. For example, if we continue 
JL JL with the notation of the previous paragraph, the line 

\omit a 1 \span02 \span\omit 03 \cr 

would put the material for ‘aiu^aii^as' into the hbox just considered. 

/>)/>) It’s fairly common to span several columns and to omit all their templates, 
JL JL so plain IjgX provides a \multispan macro that spans a given number of 
columns. For example, ‘\imiltispan3’ expands into ‘\omit\span\omit\span\omit’. If 
the number of spanned columns is greater than 9 , you must put it in braces, e.g., 
‘\niultispan{ 13 }’. 


spanned columns in tables 

hidewidth 

span 

omit 

multispan 
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The preceding paragraphs are rather abstract, so let’s look at an example that 
shows what \span actually does. Suppose you type 


$$\tabskip=3em 

\vbox{\halign{&\hrulef ill#\hrulef ill\cr 
f irst&secondftthird\cr 
f irst-and-second\span\omit&\cr 
&second-and-third\span\omit\cr 
f irst-second-third\span\omit\span\omit\cr}}$$ 


The preamble specifies arbitrarily many templates equal to ‘\hrulef ill#\hrulef ill’; 
the \hrulefill macro is like \hfill except that the blank space is filled with a hor- 
izontal rule. Therefore you can see the filling in the resulting alignment, which shows 
the spanned columns: 

first second third 

_first-and-second_ 

_second-and-third_ 

first-second-third 


The rules stop where the tabskip glue separates columns. You don’t see rules in the 
first line, since the entries in that line were the widest in their columns. However, if 
the tabskip glue had been 1 ern instead of 3 em, the table would have looked like this: 

first _second_ third 

first-and-second 

second-and-third 

first-second-third 

EXERCISE 22.10 

JL JL Consider the following table, which is called Walter’s worksheet: 

1 Adjusted gross income $4,000 

2 Zero bracket amount for 

a single individual $2,300 

3 Earned income 1,500 

4 Subtract line 3 from line 2 800 

5 Add lines 1 and 4. Enter here 

and on Form 1040, line 35 $4,800 

Define a preamble so that the following specification will produce Walter’s worksheet. 
\halign{ (preamble) \cr 

l&Adjusted gross income\dotf ill\span\omit\span&\$4 , 000\cr 
2&Zero bracket amount for&\cr 
&a single individual\dotf ill\span\omit&\$2 ,300\cr 
3&Earned income\dotf ill\span\omit&\underbar{ l,500}\cr 
4&Subtract line 3 from line 2\dotfill 
\span\omit\span&\underbar{ 800}\cr 
5&Add lines 1 and 4. Enter here\span\omit\span\cr 
&and on Form 1040, line 35\dotfill\span\omit\span&\$4,800\cr} 

(The macro \dotfill is like \hrulefill but it fills with dots; the macro \underbar 
puts its argument into an hbox and underlines it.) 


hrulefill 

Walters worksheet 
IRS 

Green. Walter 

dotfill 

underbar 
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/gK/gK Notice the “early” appearance of \cr in line 2 of the previous exercise. You 
JL JL needn’t have the same number of columns in every line of an alignment; ‘\cr’ 
means that there are no more columns in the current line. 


/g ,/g> EXERCISE 22.11 

JL JL Explain how to typeset the generic matrix 


a 11 

a 12 

. . Clin 

021 

022 

(l 2 n 

CL ml 

O’ m 2 

• • CLmn 


/gL/gL The presence of spanned columns adds a complication to TjgX’s rules for calcu- 
JL JL lating column widths; instead of simply choosing the maximum natural width 
of the column entries, it’s also necessary to make sure that the sum of certain widths 
is big enough to accommodate spanned entries. So here is what TjrX actually does: 
First, if any pair of adjacent columns is always spanned as a unit (i.e., if there’s a \span 
between them whenever either one is used), these two columns are effectively merged 
into one and the tabskip glue between them is set to zero. This reduces the problem to 
the case that every tab position actually occurs at a boundary. Let there be n columns 
remaining after such reductions, and for 1 < i < j < n let wy be the maximum nat- 
ural width of all entries that span columns i through j, inclusive; if there are no such 
spanned entries, let wy = — oo. (The merging of dependent columns guarantees that, 
for each j, there exists i < j such that toy > — oo.) Let f* be the natural width of the 
tabskip glue between columns k and k + 1 , for 1 < k < n. Now the final width wj of 
column j is determined by the formula 


= max (wy - J2i 

l<i<3 


• ■ (Wk +tk)) 


for j = 1, 2, . . . , n (in this order). It follows that toy < to,- + !,: + ••• +tj-i +Wj, for all 
i < j, as desired. After the widths wj are determined, the tabskip amounts may have 
to stretch or shrink; if they shrink, toy might turn out to be more than the final width 
of a box that spans columns i through j, hence the glue in such a box might shrink. 

/gL/gL These formulas usually work fine, but sometimes they produce undesirable 
JL JL effects. For example, suppose that n = 3, ton = W22 = w 33 = 10, w 12 = 
W23 = —00, and w 13 = 100; in other words, the columns by themselves are quite 
narrow, but there’s a big wide entry that’s supposed to span all three columns. In this 
case TjgX’s formula makes toi = 102 = 10 but to 3 = 80 — fi — ( 2 , so all the excess width 
is allocated to the third column. If that’s not what you want, the remedy is to use 
\hidewidth, or to increase the natural width of the tabskip glue between columns. 

/gL/gL The next level of complexity that occurs in tables is the appearance of lior- 
JL JL izont.al and vertical ruled lines. People who know how to make ruled tables 
are generally known as TjjX Masters. Are you ready? 

/gL/gL If you approach vertical rules in the wrong manner, they can be difficult; but 
JL JL there is a decent way to get them into tables without shedding too many tears. 
The first step is to say ‘\off interlineskip’, which means that there will be no blank 
space between lines; TgX cannot be allowed to insert interline glue in its normal clever 
way, because each line is supposed to contain a \vrule that abuts another \vrule in the 
neighboring lines above and/or below. We will put a strut into every line, by including 
one in the preamble; then each line will have the proper height and depth, and there 
will be no need for interline glue. TjgX puts every column entry of an alignment into 


cr 

generic matrix 
hidewidth 
ruled tables 
TeX Masters 
offinterlineskip 
interline glue 
vrule 
vrule 
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an libox whose height and depth are set equal to the height and depth of the entire 
line; therefore \vrule commands will extend to the top and bottom of the lines even 
when their height and/or depth are not specified. 


/gb/gb A “column” should be allocated to every vertical rule, and such a column can 
JL JL be assigned the template ‘\vrule#’. Then you obtain a vertical rule by simply 
leaving the column entries blank, in the normal lines of the alignment; or you can say 
‘\omit’ if you want to omit the rule in some line; or you can say ‘height 10pt’ if you 
want a nonstandard height; and so on. 



Here is a small table that illustrates the points just made. [The data appeared 
in an article by A. H. Westing, BioScience 31 (1981), 523-524.] 


\vbox{\off interline skip 
\hrule 

\halign{&\vrule#& 

\ strut\quad\hf il#\quad\ cr 
height2pt&\omitM\omit&\cr 
&Year\hf ilft&World Population&Xcr 
height2pt&\omit&&\omit&\cr 
\noalign{\hrule} 
height2pt&\omit&&\omit&\cr 
&8000XBC&&5 , 000 , OOO&Xcr 
&50XAD&&200 , 000 , OOO&Xcr 
&1650XAD&&500 , 000 , OOO&Xcr 
&1850XAD&&1 , 000 , 000 , OOO&Xcr 
&1945XAD&&2 , 300 , 000 , 000&\cr 
&1980XAD&&4 ,400 , 000 , 000&\cr 
height2pt&\omit&&\omit&\cr} 
Xhrule} 


Year 

World Population 

8000 

B.C. 

5,000,000 

50 

A.D. 

200,000,000 

1650 

A.D. 

500,000,000 

1850 

A.D. 

1,000,000,000 

1945 

A.D. 

2,300,000,000 

1980 

A.D. 

4,400,000,000 


Westing 

hrule 

noalign 

multispan 

strut 


In this example the first, third, and fifth columns are reserved for vertical rules. Hori- 
zontal rules are obtained by saying ‘Xhrule’ outside the Xhalign or ‘\noalign{\hrule}’ 
inside it, because the Xhalign appears in a vbox whose width is the full table width. 
The horizontal rules could also have been specified by saying ‘\multispan5\hrulef ill’ 
inside the Xhalign, since that would produce a rule that spans all five columns. 

/gb/gb The only other nonobvious thing about this table is the inclusion of several 
JL JL lines that say ‘height2pt&\omit&&\omit&\cr’; can you see what they do? The 
\omit instructions mean that there’s no numerical information, and they also suppress 
the Xstrut from the line; the ‘height2pt’ makes the first Xvrule 2pt high, and the 
other two rules will follow suit. Thus, the effect is to extend the vertical rules by two 
points, where they touch the horizontal rules. This is a little touch that improves the 
appearance of boxed tables; look for it as a mark of quality. 



EXERCISE 22.12 

Explain why the lines of this table say ‘ftXcr’ instead of just ‘\cr’. 


Another way to get vertical rules into tables is to typeset without them, then 
JL JL back up (using negative glue) and insert them. 




Chapter 22: Alignment 


247 


Here is another table; this one has become a classic, ever since Michael Lesk 
published it as one of the first examples in his report on a program to format 
tables [Bell Laboratories Computing Science Technical Report 49 (1976)]. It illustrates 
several typical problems that arise in connection with boxed information. In order to 
demonstrate TgX’s ability to adapt a table to different circumstances, tabskip glue is 
used here to adjust the column widths; the table appears twice, once generated by 
‘\halign to!25pt’ and once by ‘\halign to200pt’, with nothing else changed. 


Lesk 

AT&T 

hidewidth 

rlap 


| AT&T Common Stock j 

Year 

Price 

Dividend 

1971 

41-54 

$2.60 

2 

41-54 

2.70 

3 

46-55 

2.87 

4 

40-53 

3.24 

5 

45-52 

3.40 

6 

51-59 

.95* 


[ AT&T Common Stock | 

Year 

Price 

Dividend 

1971 

41-54 

$2.60 

2 

41-54 

2.70 

3 

46-55 

2.87 

4 

40-53 

3.24 

5 

45-52 

3.40 

6 

51-59 

.95* 


* (first quarter only) * (first quarter only) 

The following specification did the job: 

\vbox{\tabskip=Opt \off interlineskip 
\def\tablerule{\noalign{\hrule}} 

\halign to(dimen){\strut#& \vrule#\tabskip=lem plus2em& 

\hfil#& \vrule#& \hf il#\hf il& \vrule#& 

\hf il#& \vrule#\tabskip=Opt\cr\tablerule 
&&\multispan5\hf il AT\&T Common Stock\hf il&\cr\tablerule 
&&\omit\hidewidth Year\hidewidth&& 

\omit\hidewidth Price\hidewidth&& 

\omit\hidewidth Dividend\hidewidth&\cr\tablerule 
&&1971&&41 — 54&&\$2 . 60&\cr\tablerule 
&& 2&&41 — 54&&2 . 70&\cr\tablerule 

&& 3&&46 — 55&&2 . 87&\cr\tablerule 

&& 4&&40 — 53&&3. 24&\cr\tablerule 

&& 5&&45 — 52&&3.40&\cr\tablerule 

&& 6&&51 — 59&& . 95\rlap*&\cr\tablerule \noalign{\smallskip} 

&\multispan7* (first quarter only)\hf il\cr}} 

Points of interest are: (1) The first column contains a strut; otherwise it would have 
been necessary to put a strut on the lines that say ‘AT&T’ and ‘(first quarter only)’, 
since those lines omit the templates of all other columns that might have a built-in 
strut. (2) ‘\hidewidth’ is used in the title line so that the width of columns will be 
affected only by the width of the numeric data. (3) ‘\rlap’ is used so that the asterisk 
doesn’t affect the alignment of the numbers. (4) If the tabskip specification had been 
‘Oem plus3em’ instead of ‘lem plus2em’, the alignment wouldn’t have come out right, 
because ‘AT&T Common Stock’ would have been wider than the natural width of 
everything it spanned; the excess width would all have gone into the ‘Dividend’ column. 


''§)(>)*■ EXERCISE 22.13 

JL JL Explain how to add 2 pt more space above and below ‘AT&T Common Stock’. 
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► EXERCISE 22.14 

Typeset the following chart, making it exactly 36em wide: 

J. H. Bohning, 1838 


L. M. Bohning, 1912 


M. J. H. Bohning, 1882 


P. A. M. Ehlert, 1884 


M. D. Blase, 1840 


E. F. Ehlert,, 1845 


C. L. Wischmeyer, 1850 


If you’re having trouble debugging an alignment, it sometimes helps to put 
‘\ddt’ at the beginning and end of the templates in your preamble. This 
is an undefined control sequence that causes TjgX to stop, displaying the rest of the 
template. When T|.X stops, you can use \showlists and other commands to see what 
the machine thinks it’s doing. If TgX doesn’t stop, you know that it never reached that 
part of the template. 

It’s possible to have alignments within alignments. Therefore when TgX sees a 
‘ft’ or ‘\span’ or ‘\cr’, it needs some way to decide which alignment is involved. 
The rule is that an entry ends when “&’ or ‘\span’ or ‘\cr’ occurs at the same level of 
braces that was current when the entry began; i.e., there must be an equal number of 
left and right braces in every entry. For example, in the line 

\matrix{lftl\cr 0&l\cr}ft\matrix{0ftl\cr 0&0\cr}\cr 


TEX will not resume the template for the first column when it is scanning the argument 
to \matrix, because the ft’s and \cr’s in that argument are enclosed in braces. Similarly, 
ft’s and \cr’s in the preamble do not denote the end of a template unless the resulting 
template would have an equal number of left and right braces. 

You have to be careful with the use of & and \span and \cr, because these 
JL JL tokens are intercepted by TEX’s scanner even when it is not expanding macros. 
For example, if you say ‘\let\x=\span’ in the midst of an alignment entry, TEX will 
think that the ‘\span’ ends the entry, so \x will become equal to the first token fol- 
lowing the “#’ in the template. You can hide this \span by putting it in braces; e.g., 
‘{\global\let\x=\span}’. (And Appendix D explains how to avoid \global here.) 


Sometimes people forget the \cr on the last line of an alignment. This can 
cause mysterious effects, because TEX is not clairvoyant. For example, con- 
sider the following apparently simple case: 


\halign{\ center line{#}\cr 
A centered line.\cr 
And another?} 


(Notice the missing \cr.) A curious thing happens here when TEX processes the 
erroneous line, so please pay attention. The template begins with ‘\centerline{’, 
so TEX starts to scan the argument to \centerline. Since there’s no ‘\cr’ after the 
question mark, the “}’ after the question mark is treated as the end of the argument 
to \centerline, not as the end of the \halign. TEX isn’t going to be able to finish 
the alignment unless the subsequent text has the form . .\cr’. Indeed, an entry 


family tree 

Bohning [Knuth], Louise Marie 

Ehlert [Bohning], Pauline Anna P 1 

Bohning, Martin John Henry 

Wischmeyer [Ehlert], Clara Louis 

Ehlert, Ernst Fred 

Blase [Bohning], Maria Dorothea 

Bohning, Jobst Heinrich 

debugging 

ddt 

braces 

span 

cr 

ampersand 
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like ‘a}b{c’ is legitimate with respect to the template ‘\centerline{#}’, since it yields 
‘\centerline{a}b{c}’; Tj-'X is correct when it gives no error message in this case. But 
the computer’s idea of the current situation is different from the user’s, so a puzzling 
error message will probably occur a few lines later. 


/>>/>) To help avoid such situations, there’s a primitive command \crcr that acts 
JL JL exactly like \cr except that it does nothing when it immediately follows a \cr 
or a \noalign{. . Thus, when you write a macro like \matrix, you can safely insert 
\crcr at the end of the user’s argument; this will cover up an error if the user forgot 
the final \cr, and it will cause no harm if the final \cr was present. 



Are you tired of typing \cr? You can get plain TpX to insert an automatic 
\cr at the end of each input line in the following way: 


\begingroup \let\par=\cr \obeylines '/, 
\halign{ (preamble) 

(first line of alignment) 


(last line of alignment) 

}\endgroup 

This works because \obeylines makes the ASCII (return) into an active character that 
uses the current meaning of \par, and plain TjrjX puts (return) at the end of an input 
line (see Chapter 8). If you don’t want a \cr at the end of a certain line, just type 
“/,’ and the corresponding \cr will be “commented out.” (This special mode doesn’t 
work with \+ lines, since \+ is a macro whose argument is delimited by the token ‘\cr’, 
not simply by a token that has the same meaning as \cr. But you can redefine \+ to 
overcome this hurdle, if you want to. For example, define a macro \alternateplus 
that is just like \+ except that its argument is delimited by the active character **M; 
then include the command ‘\let\+=\alternateplus’ as part of \obeylines.) 

f Tlie control sequence \valign is analogous to \halign, but rows and columns 
change roles. In this case \cr marks the bottom of a column, and the aligned 
columns are vboxes that are put together in horizontal mode. The individual entries 
of each column are vboxed with depth zero (i.e., as if \boxmaxdepth were zero, as 
explained in Chapter 12); the entry heights for each row of a \valign are maximized 
in the same fashion as the entry widths for each column of an \halign are maximized. 
The \noalign operation can now be used to insert horizontal mode material between 
columns; the \span operation now spans rows. People usually work with TjrjX at least a 
year before they find their first application for \valign; and then it’s usually a one-row 
‘\valign{\vf il#\vf il\cr . . But the general mechanism is there if you need it. 


If sixteen pennies are arranged in the form of a square 
there will be the same number of pennies in every row, every column, 

and each of the two long diagonals. 
Can you do the same with twenty pennies? 
— HENRY ERNEST DUDENEY, The Best Coin Problems (1909) 
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It was she who controlled the whole of the Fifth Column. 

— AGATHA CHRISTIE, N or M? (1941) 
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We investigated TeX’s page-building technique in Chapter 15, where we dis- 
cussed the basic two-stage strategy that is used: TgX gathers material until it 
has accumulated more than will fit on a page; then it spews out one page of data, 
based on what it thinks is the best breakpoint between pages; then it returns 
to gather material for the next page in the same way. Page numbers, headings, 
and similar things are attached after each page has been ejected, by a special 
sequence of TgX commands called the current output routine. 

Plain TgX has an output routine that takes care of ordinary jobs. It han- 
dles the simple things that most manuscripts require, and it also copes with more 
complicated things like the insertions made with \footnote and \topinsert, 
as described in the dangerous bends of Chapter 15. We shall begin the present 
chapter by discussing how to make simple changes to the behavior of plain TeX’s 
output routine; then we shall turn to the details of how to define output routines 
that do more complex tasks. 

If you run TeX without modifying the plain TeX format, you get pages 
that are numbered at the bottom; and each page will be approximately 8^ inches 
wide and 11 inches tall, including 1-inch margins at all four sides. This format 
is suitable for preprints of technical papers, but you might well want to change 
it, especially if you are not using TeX to make a preprint of a technical paper. 

For example, we saw in the experiments of Chapter 6 that the width 
of the material on a page can be changed by giving a different value to the 
horizontal line size, \hsize. Plain TeX format says ‘\hsize=6 . 5in’, in order to 
obtain 8.5-inch pages with 1-inch margins; you can change \hsize to whatever 
you want. Similarly, you can control the vertical size of a page by changing 
\vsize. Plain TeX sets \vsize=8.9in (not 9in, since \vsize doesn’t include 
the space for page numbers at the bottom of each page); if you say ‘\vsize=4in’ 
you will get shorter pages, with only 4 inches of copy per sheet. It’s best not to 
monkey with \hsize and \vsize except at the very beginning of a job, or after 
you have ejected all pages from TeX’s memory. 

If you want your output to be positioned differently when it is ultimately 
printed, you can offset it by giving nonzero values to \hoffset and \voffset. 
For example, 

\hof f set= . 5in \vof f set=l . 5in 

will move the output half an inch to the right of its normal position, and 1.5 
inches down. You should be careful not to offset the output so much that it 
falls off the edge of the physical medium on which it is being printed, unless you 
know that such out-of-bounds activity won’t cause trouble. 

TeX is often used to typeset announcements, brochures, or other docu- 
ments for which page numbers are inappropriate. If you say 

\nopagenumbers 

at the beginning of your manuscript, plain TeX will refrain from inserting num- 
bers at the bottom of each page. 
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In fact, \nopagenumbers is a special case of a much more general mechanism 
by which you can control headings and footings. The plain TjjX output rou- 
tine puts out a special line of text, called the headline at the top of each page, and 
another special line of text called the footline at the bottom. The headline is normally 
blank, and the footline is normally a centered page number, but you can specify any 
headline and footline that you want by redefining the control sequences \headline and 
\footline. For example, 

\headline={\hrulef ill} 

will put a horizontal rule at the top of every page. The basic idea is that plain TjgX puts 
‘\line{\the\headline}’ at the top and ‘\line{\the\f ootline}’ at the bottom, with 
blank lines separating these extra lines from the other material. (Recall that Mine 
is an abbreviation for ‘\hbox to\hsize’; hence the headline and footline are put into 
boxes as wide as the normal lines on the page itself.) The normal value of \headline 
is ‘\hfil’, so that no heading is visible. The \nopagenumbers macro described earlier 
is simply an abbreviation for ‘\f ootline={\hf il}’. 

The normal value of \footline is ‘\hss\tenrm\f olio\hss’; this centers the 
page number on a line, using font \tenrm, because \folio is a control sequence 
that produces the number of the current page in text form. 

The page number appears in IjgX’s internal register \countO, as explained in 
Chapter 15, and plain TjrX makes \pageno an abbreviation for \countO. Thus 
you can say ‘\pageno=100’ if you want the next page of your output to be number 100. 
The \folio macro converts negative page numbers to roman numerals; if your manu- 
script begins with ‘\pageno=-l’, the pages will be numbered i, ii, iii, iv, v, etc. In fact, 
Appendix B defines \folio to be an abbreviation for 





\ifnum\pageno<0 \romannumeral-\pageno \else\number\pageno \fi 

f it is important to include the name of each font explicitly whenever you are 
defining a headline or footline, because an output routine in T^X can come 
into action at somewhat unpredictable times. For example, suppose that \f ootline 
had been set to ‘\hss\folio\hss’, without specifying \tenrm; then the page number 
would be typeset in whatever font happens to be current when TjrX decides to output a 
page. Mysterious effects can occur in such cases, because TgX is typically in the midst 
of page 101 when it is outputting page 100. 



► EXERCISE 23.1 

Explain how to put en-dashes around the page numbers in a plain T)?X job. 


For example, ‘ - 4 - 1 should appear at the bottom of page 4. 



Here is an example of a headline in which the page numbers appear at the top. 
Furthermore, odd-numbered and even-numbered pages are treated differently: 


\nopagenumbers '/, suppress footlines 

\headline={\ifodd\pageno\rightheadline \else\leftheadline\f i} 
\def\rightheadline{\tenrm\hf il EIGHT RUNNING HEAD\hf il\folio} 
\def\leftheadline{\tenrm\folio\hf il LEFT RUNNING HEAD\hf il} 
\voff set=2\baseline skip 
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English-language books traditionally have odd-numbered pages on the right and even- 
numbered pages on the left. Text that appears as a headline on several pages is often 
called a “running head.” When you use headlines, it is generally wise to set \voffset 
to the equivalent of two lines of text, as shown in this example, so that there will still 
be a margin of one inch at the top of your output pages. 

^ ►EXERCISE 23.2 

JL Suppose that you’re using IgX to typeset your resume, which is several pages 
long. Explain how to define \headline so that the first page is headed by ‘RESUME’, 
centered in boldface type, while each subsequent page has a headline like this: 


Resume of A. U. Thor 


Page 2 


f lf you don’t change the \vsize, all of the headlines and footlines will occur 
in the same place regardless of the contents of the page between them. Thus, 
for example, if you are using \raggedbottom as explained in Chapter 15, so that pages 
do not always contain the same amount of text, the raggedness will occur above the 
footline; the footline won’t move up. If you do change \vsize, the footline position 
will change correspondingly, while the headline will stay put. 

&)(&) The rest of this chapter is intended for people who want an output format 
JL JL that is substantially different from what plain TpX provides. Double dangerous 
bends are used in all of the subsequent paragraphs, because you should be familiar with 
the rest of TgX before you plunge into these final mysteries of the language. Chapter 22 
taught you how to be a TjrX Master, i.e. , a person who can produce complicated tables 
using \halign and \valign; the following material will take you all the way to the rank 
of Grandmaster, i.e., a person who can design output routines. When you are ready 
for this rank, you will be pleased to discover that — like alignments — output routines 
are not really so mysterious as they may seem at first. 

/gb/gb Let’s begin by recapping some of the rules at the end of Chapter 15. TgX 
JL JL periodically chooses to output a page of information, by breaking its main 
vertical list at what it thinks is the best place, and at such times it enters internal 
vertical mode and begins to read the commands in the current \output routine. When 
the output routine begins, \box255 contains the page that TjgX has completed; the 
output routine is supposed to do something with this vbox. When the output routine 
ends, the list of items that it has constructed in internal vertical mode is placed just 
before the material that follows the page break. In this way IjjX’s page-break decisions 
can effectively be changed: Some or all of the material on the broken-off page can be 
removed and carried forward to the next page. 

/>)/>) The current \output routine is defined as a token list parameter, just like 
JL JL \everypar or \errhelp, except that T^rX automatically inserts a begin-group 
symbol ‘{’ at the beginning and an end-group symbol at the end. These grouping 
characters help to keep the output routine from interfering with what TjjX was doing 
when the page break was chosen; for example, an output routine often changes the 
\baselineskip when it puts a headline or footline on a page, and the extra braces 
keep this change local. If no \output routine has been specified, or if the user has 
said ‘\output={}’, TpX supplies its own routine, which is essentially equivalent to 
‘\output={\shipout\box255}’; this outputs the page without any headline or footline, 
and without changing the page number. 
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/gK/gK TjjX’s primitive command \shipout{box) is what actually causes output. It 
JL JL sends the contents of the box to the dvi file, which is TgX’s main output file; 
after IjgX has finished, the dvi file will contain a compact device-independent encoding 
of instructions that specify exactly what should be printed. When a box is shipped out, 
TjgX displays the values of \countO through \count9 on your terminal, as explained 
in Chapter 15; these ten counters are also recorded in the dvi file, where they can be 
used to identify the page. All of the \openout, \closeout, and Write commands that 
appear inside of the (box) are performed in their natural order as that box is being 
shipped out. Since a Xwrite command expands macros, as explained in Chapter 21, 
IjgX’s scanning mechanism might detect syntax errors while a \shipout is in progress. 
If \tracingoutput is nonzero at the time of a \shipout, the contents of the (box) 
being shipped are written into your log file in symbolic form. You can say \shipout 
anywhere, not only in an output routine. 

/gb/gb The delayed aspect of Write imposes a noteworthy restriction: It is necessary 
JL JL to be sure that all macros that might appear within the text of a Write are 
properly defined when a \shipout command is given. For example, the plain T)?X for- 
mat in Appendix B temporarily makes spaces active and says ‘\global\let u =\space’; 
the reason is that \obey spaces might be in force during a Write command, so a defi- 
nition for u as an active character should exist during the next \shipout, even though 
IgX might no longer be making spaces active at that time. 

/gb/gb Chapter 15 points out that T|X gives special values to certain internal registers 
JL JL and parameters, in addition to \box255, just before the output routine begins. 
Insertions are put into their own vboxes, and \insertpenalties is set equal to the total 
number of lreldover insertions; furthermore the \outputpenalty parameter is set to the 
value of the penalty at the current breakpoint. An output routine can be made to do 
special things when these quantities have special values. For example, the output 
routine of plain TjjX recognizes a \supereject (which ejects all held-over insertions) 
by the fact that Wupereject causes \outputpenalty to be —20000, and by using 
\insertpenalties to decide if any insertions are being held over. 


/gb/gb The default output routine, ‘\shipout\box255’, illustrates one extreme in 
JL JL which nothing is put into the vertical list that is carried over to the next page. 
The other extreme is 


dvi 

countO 

openout 

closeout 

write 

tracingoutput 

space 
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outputpenalty 
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\output={\unvbox255 \penalty\output penalty} 

which ships nothing out and puts everything back onto the main vertical list. (The 
command ‘\unvbox255 1 takes the completed page out of its box, and the command 
‘\penalty\outputpenalty’ reinserts the penalty at the chosen breakpoint.) This 
makes a seamless join between the completed page and the subsequent material, be- 
cause TjjX has still not discarded glue and penalties at the breakpoint when it invokes 
an \output routine; hence Tj?X will go back and reconsider the page break. If the 
\vsize hasn’t changed, and if all insertions have been held in place, the same page 
break will be found; but it will be found much faster than before, because the vertical 
list has already been constructed — the paragraphing doesn’t need to be done again. Of 
course, an output routine like this makes IjgX spin its wheels endlessly, so it is of no 
use except as an example of an extreme case. 
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/>)/>) To prevent such looping, your output routine should always make progress 
JL JL of some sort whenever it comes into play. If you make a mistake, TgX may 
be able to help you diagnose the error, because a special loop-detection mechanism 
has been built in: There is an internal integer variable called \deadcycles, which 
is cleared to zero after every \shipout and increased by 1 just before every \output. 
Thus, \deadcycles keeps track of how many times an output routine has been initiated 
since the most recent \shipout, unless you change the value of \deadcycles yourself. 
There’s also an integer parameter called \maxdeadcycles, which plain TjjX sets to 25. 
If \deadcycles is greater than or equal to \maxdeadcycles when your output routine 
is about to be started (i.e., when \deadcycles is about to be increased), TgX issues an 
error message and performs the default output routine instead of yours. 

/gb/gb When your output routine is finished, \box255 should be void. In other words, 
JL JL you must do something with the information in that box; it should either be 
shipped out or put into some other place. Similarly, \box255 should be void when TgX 
is getting ready to fill it with a new page of material, just before starting an output 
routine. If \box255 is nonvoid at either of those times, T^X will complain that you are 
misusing this special register, and the register contents will be destroyed. 


/>>/>) But let’s not talk forever about borderline cases and special parameters; let’s 
JL JL look at some real examples. The output routine of plain TjvX, found in Ap- 
pendix B, is set up by saying l \output={\plainoutput}’, where \plainoutput is an 
abbreviation for 


\shipout\vbox{\makeheadline 

\pagebody 

\makefootline} 

\advancepageno 

\ifnum\outputpenalty>-20000 \else\dosupereject\f i 


Let us consider this “program” one line at a time: 

1) The \makeheadline macro constructs a vbox of height and depth zero in such 
a way that the headline is properly positioned above the rest of the page. Its actual 
code is 

\vbox to 0pt{\vskip-22 . 5pt 

\line{\vbox to8. 5pt{}\the\headline}\vss} 

\nointerlineskip 

The magic constant — 22.5pt is equal to (topskip — height of strut — 2(baselineskip)), 
i.e., 10 pt — 8.5 pt — 24 pt; this places the reference point of the headline exactly 24 pt 
above the reference point of the top line on the page, unless the headline or the top 
line are excessively large. 

2) The \pagebody macro is an abbreviation for 


\vbox to\vsize{\boxmaxdepth=\maxdepth \pagecontents} 
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The value of \boxmaxdepth is set to \maxdepth so that the vbox will be constructed 
under the assumptions that TjgX’s page builder has used to set up \box255. 
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3) The \pagecontents macro produces a vertical list for everything that belongs 
on the main body of the page, namely the contents of \box255 together with illustra- 
tions (inserted at the top) and footnotes (inserted at the bottom): 

\ifvoid\topins \else\unvbox\topins\f i 
\dimen0=\dp255 \unvbox255 

\ifvoid\f ootins\else '/, footnote info is present 
\vskip\skip\f ootins 
\footnoterule 
\unvbox\f ootins\f i 

\ifraggedbottom \kern-\dimenO \vfil \fi 

Here \topins and \footins are the insertion class numbers for the two kinds of inser- 
tions used in plain TjgX; if more classes of insertions are added, \pagecontents should 
be changed accordingly. Notice that the boxes are unboxed so that the glue coming 
from insertions can help out the glue on the main page. The \f ootnoterule macro in 
Appendix B places a dividing line between the page and its footnotes; it makes a net 
contribution of Opt to the height of the vertical list. Ragged-bottom setting is achieved 
by inserting infinite glue, which overpowers the stretchability of \topskip. 

4) The \makefootline macro puts \footline into its proper position: 

\baselineskip=24pt 

\line{\the\footline} 

5) The \advancepageno macro normally advances \pageno by +1; but if \pageno 
is negative (for roman numerals), the advance is by —1. The new value of \pageno will 
be appropriate for the next time the output routine is called into action. 

\ifnum\pageno<0 \global\advance\pageno by-1 
\else \global\advance\pageno by 1 \fi 

6) Finally, the \dosupereject macro is designed to clear out any insertions that 
have been held over, whether they are illustrations or footnotes or both: 

\ifnum\insertpenalties>0 

\line{} \kern-\topskip \nobreak 
\vf ill\ supereject \fi 

The mysterious negative \kern here cancels out the natural space of the \topskip 
glue that goes above the empty Mine; that empty line box prevents the \vfill from 
disappearing into a page break. The vertical list that results from \dosupereject is 
placed on IjgX’s list of things to put out next, just after the straggling insertions have 
been reconsidered as explained in Chapter 15. Hence another super-eject will occur, 
and the process will continue until no insertions remain. 

/£)/£)► EXERCISE 23.3 

JL JL Explain how to change the output routine of plain TjgX so that it will produce 
twice as many pages. The material that would ordinarily go on pages 1, 2, 3, etc., should 
go onto pages 1, 3, 5, ... ; and the even-numbered pages should be entirely blank except 
for the headline and footline. (Imagine that photographs will be mounted on those 
blank pages later.) 
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/gL/gL Suppose now that double-column format is desired. More precisely, let’s at- 
JL JL tempt to modify plain TjjX so that it sets type in columns whose width is 
\hsize=3.2in. Each actual page of output should contain two such columns separated 
by 0.1 in of space; thus the text area of each page will still be 6.5 inches wide. The 
headlines and footlines should span both columns, but the columns themselves should 
contain independent insertions as if they were the facing pages of a book. In other 
words, each column should contain its own footnotes and its own illustrations; we do 
not have to change the \pagebody macro. 

/gL/gL I n order to solve this problem, let us first introduce a new dimension register 
JL JL called \fullhsize that represents the width of an entire page. 

\newdimen\fullhsize 
\fullhsize=6. 5in \hsize=3.2in 
\def\fullline{\hbox to\fullhsize} 

The \makeheadline and \makef ootline macros should be modified so that they use 
‘\fullline’ instead of ‘\line’. 

/gL/gL The new output routine will make use of a control sequence \lr that is set 
JL JL to either ‘L’ or ‘R’, according as the next column belongs at the left or at the 
right of the next page. When a left column has been completed, the output routine 
simply saves it in a box register; when a right column has been completed, the routine 
outputs both columns and increases the page number. 

\let\lr=L \newbox\leftcolumn 
\output={\if L\lr 

\global\setbox\leftcolumn=\columnbox \global\let\lr=R 
\else \doubleformat \global\let\lr=L\f i 
\ifnum\outputpenalty>-20000 \else\dosupere ject\f i} 
\def\doubleformat{\shipout\vbox{\makeheadline 
\fullline{\box\leftcolumn\hf il\columnbox} 

\makef ootline} 

\advancepageno} 

\def\coluranbox{\leftline{\pagebody}} 

The \columnbox macro uses \leftline in order to ensure that it produces a box whose 
width is \hsize. The width of \box255 is usually, but not always, equal to \hsize at 
the beginning of an output routine; any other width would louse up the format. 

/gK/gK When double-column setting ends, there’s a 50-50 chance that the final column 
JL JL has fallen at the left, so it will not yet have been output. The code 
\supere ject 

\if R\lr \null\vf ill\eject\fi 

supplies an empty right-hand column in this case, ensuring that all of the accumulated 
material will be printed. It’s possible to do fancier column balancing on the last page, 
but the details are tricky if footnotes and other insertions need to be accommodated 
as well. Appendix E includes the macros that were used to balance the columns at the 
end of the index in Appendix I, and to start two-column format in mid-page. 

/gK/gK^ EXERCISE 23.4 

JL JL How should the example above be modified if you want three-column output? 
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/>)/>) Since IjjX’s output routine lags behind its page-construction activity, you can 
JL JL. get erroneous results if you change the \headline or the \footline in an 
uncontrolled way. For example, suppose that you are typesetting a book, and that the 
format you are using allows chapters to start in the middle of a page; then it would 
be a mistake to change the running headline at the moment you begin a new chapter, 
since the next actual page of output might not yet include anything from the new 
chapter. Consider also the task of typesetting a dictionary or a membership roster; a 
well-designed reference book displays the current range of entries at the top of each 
page or pair of pages, so that it is easy for readers to thumb through the book when they 
are searching for isolated words or names. But IjgX’s asynchronous output mechanism 
makes it difficult, if not impossible, to determine just what range of entries is actually 
present on a page. 

/gb/gb Therefore TjrX provides a way to put “marks” into a list; these marks inform 
JL JL the output routine about the range of information on each page. The general 
idea is that you can say 


\mark{{mark text)} 


in the midst of the information you are typesetting, where the (mark text) is a token 
list that is expanded as in the commands \edef, \message, etc. T^rX puts an internal 
representation of the mark text into the list it is building; then later on, when a 
completed page is packed into \box255, T|,’X allows the output routine to refer to the 
first and last mark texts on that page. 

/gb/gb The best way to think of this is probably to imagine that T^X generates an 
JL JL arbitrarily long vertical list of boxes, glue, and other items such as penalties 
and marks. Somehow that long vertical list gets divided up into pages, and the pages 
are made available to the output routine, one at a time. Whenever a page is put in 
\box255, TjgX sets up the value of three quantities that act essentially like macros: 

■ \botmark is the mark text most recently encountered on the page that was 
just boxed; 

■ \firstmark is the mark text that was first encountered on the page that was 
just boxed; 

■ \topmark has the value that \botmark had just before the current page was 
boxed. 


Before the first page, all three of these are null, i.e., they expand to nothing. When 
there is no mark on a page, all three are equal to the previous \botmark. 



page 2, 


For example, suppose that your manuscript includes exactly four marks, and 
that the pages are broken in such a way that \mark{cv} happens to fall on 
\mark{/3} and Xmarkfy} on page 4, and \mark{<5} on page 5. Then 
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/>)/>) When you use a \mark command in vertical mode, TjrX puts a mark into the 
JL JL main vertical list. When you use a \mark command in horizontal mode, TjgX 
treats it as vertical mode material like \vadjust and \insert; i.e., after the paragraph 
has been broken into lines, each mark will go into the main vertical list just after the 
box for the line where that mark originally appeared. If you use \mark in restricted 
horizontal mode, the mark may migrate out to the enclosing vertical list in the same 
way that \insert and \vadjust items do (see Chapter 24); but a mark that is locked 
too deeply inside a box will not migrate, so it will never appear as a \firstmark or 
\botmark. Similarly, a \mark that occurs in internal vertical mode goes into a vbox, 
and it is not accessible in the main vertical list. 

/gb/gb Chapter 15 discusses the \vsplit command, which allows you to break up 
JL JL vertical lists by yourself. This operation sometimes provides a useful alterna- 
tive to IjgX’s ordinary page-building mechanism. For example, if you simply want to 
typeset some material in two columns of equal height, you can put that material into 
a vbox, then \vsplit the box into two pieces; no output routine is needed at all. The 
\vsplit operation sets up the values of two macro-like quantities that were not men- 
tioned in Chapter 15: \splitf irstmark and \splitbotmark expand to the mark texts 
of the first and last marks that appear in the vertical list that was split off by the most 
recent \vsplit command. Both quantities are null if there were no such marks. The 
values of \topmark, \firstmark, \botmark, \splitf irstmark, and \splitbotmark are 
global; i.e., they are not affected by TjjX’s grouping mechanism. 

/>)/>) dictionaries use the equivalent of \f irstmark and \botmark to give guide 

JL JL words at the top of each pair of facing pages. For example, if the definition of 
the word ‘type’ starts on page 1387 and continues onto page 1388, the guide word on 
page 1387 (a right-hand page) will be ‘type’; but the guide word at the top of page 1388 
(a left-hand page) will be the next word in the dictionary (e.g. , ‘typecast’) even though 
the top of page 1388 is about ‘type’. 

/gb/gb The dictionary scheme works fine for dictionaries, since a reader should start 
JL JL reading each dictionary entry at its beginning. But a different scheme is 
appropriate for a technical book like the author’s Art of Computer Programming , 
where Section 1.2.8 (for example) starts in the middle of page 78, but the top of 
page 78 contains exercises 19-24 of Section 1.2.7. The headline at the top of page 78 
refers to ‘1.2.7’, because that will help somebody who is searching for exercise 1.2.7-22. 
Notice that the dictionary convention would put ‘1.2. 8’ at the top of page 78, but that 
would be appropriate only if Section 1.2.8 had begun exactly at the top of that page. 

&)(&) Continuing this example from The Art of Computer Programming, let’s sup- 
JL JL pose that the TjtX manuscript for Section 1.2.8 begins with a macro call like 

\beginsection 1.2.8. Fibonacci Numbers. 

How should \beginsection be defined? Here is one attempt: 


\def\beginsection #1. #2. 
{\sectionbreak 

\leftline{\sectionf ont #1. #2} 
\mark{#l} 

\nobreak\ smallskip\no indent} 


vadjust 

insert 

migrate 

vsplit 

splitfirstmark 
splitbotmark 
grouping 
guide words 

Art of Computer Programming 
Knuth 
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The \sectionbreak macro should encourage TjjX either to break the page at the current 
position, or to leave a goodly amount of blank space; e.g., \sectionbreak might be an 
abbreviation for ‘\penalty-200 \vskipl8pt plus4pt minus6pt’. The \beginsection 
macro ends with commands that suppress indentation of the first paragraph in the 
section. But the thing that concerns us with respect to output routines is the \mark 
command that follows \leftline. In the example we have been considering, the begin- 
ning of Section 1.2.8 would insert ‘\mark{l . 2 . 8}’ into the main vertical list just after 
the box containing the title of that section. 

/>)/>) Is such a \mark adequate? Unfortunately, no, not even if we assume for sim- 
JL JL plicity that at most one section begins on each page. The page that contains 
the beginning of Section 1.2.8 will then have \topmark=l . 2 . 7 and \f irstmark=l . 2 . 8, 
regardless of whether or not the section starts at the very top of the page. What we 
want in this application is a cross between \topmark and \firstmark: something that 
will reflect the mark text that represents the state of affairs just after the first line of 
the page. And TjrX doesn’t provide that. 


The solution is to emit the \mark just before the \sectionbreak, instead of 
just after the \leftline. Then \topmark will always reflect the truth about 
the section that is current at the top line. (Think about it.) 


/>)/>) However, the format for The Art of Computer Programming is more complex 
JL JL than this. On left-hand pages, the section number in the headline is sup- 
posed to reflect the situation at the top of the page, as we have just discussed, but on 
right-hand pages it is supposed to refer to the bottom of the page. Our solution to 
the previous problem made \topmark correct for the top, but it can make \botmark 
incorrect at the bottom. In order to satisfy both requirements, it is necessary to pack 
more information into the marks. Here’s one way to solve the problem: 

\def\beginsection #1. #2. 

{\mark{\currentsection \noexpand\else #1} 

\sectionbreak 

\leftline{\sectionf ont #1. #2} 

\mark{#l\noexpand\else #1} \def \currentsection{#l} 

\nobreak\ smallskip\no indent} 

\def\currentsection{} '/, the current section number 


iftrue 

expandafter 

iffalse 

Dull 


The idea is to introduce two marks, one just before the section break and one just after 
the section has begun. Furthermore each mark has two parts; the mark just before the 
potential break between Sections 1.2.7 and 1.2.8 is T.2.7\else 1.2.8’, while the one 
just after that potential break is T . 2 . 8\else 1.2.8’. It follows that the section number 
corresponding to the bottom of a page is the left component of \botmark; the section 
number corresponding to the top of a page is the right component of \topmark. The 
\rightheadline macro can make use of ‘\iftrue\botmark\f i’ to read the left compo- 
nent, and the \leftheadline macro can say ‘\expandafter\iffalse\topmark\f i’ to 
read the right component. 

<§><&>*■ EXERCISE 23.5 

JL JL B. C. Dull used a construction very much like the one above, but he put the 
second \mark just before the \leftline instead of just after it. What went wrong? 
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/>)/>>► EXERCISE 23.6 

JL JL The marks in the previous construction have the form ‘cAelse B\ where a 
and (3 are two independent pieces of information. The ‘\else’ makes it possible to 
select either a or f3 by means of \iftrue and \iffalse. Generalize this idea: Suppose 
that you have an application in which marks are supposed to carry five independent 
pieces of information, and that each mark has the form ‘ao\°r ai\or o^Xor as\or cm 1 . 
Explain how to select any one of the five cv’s from such a mark. 

Let’s conclude our discussion of output routines by considering an application 
to indexes, such as the index to this manual that appears in Appendix I. The 
most complicated entries in such an index will look something like this: 


Main entry, 4, 6, 8-10, 12, 14-16, 
18-22, 24-28, 30. 

first subsidiary entry, 1-3, 6, 10-11, 
15, 21, 24, 28. 

second subsidiary entry, 1, 3, 6-7, 
10, 15, 21, 25, 28, 31. 


Main entries and subsidiary entries are typeset ragged-right, with two ems of hanging 
indentation after the first line; subsidiary entries are indented one ern on the first line. 
Our goal will be to typeset such material from input that looks like this: 

\beginindex 


iftrue 

iffalse 

index marks 
index example 
ragged-right 


Main entry, 4, 6, 8—10, 12, 14—16, 18—22, 24—28, 30. 

\sub first subsidiary entry, 1 — 3, 6, 10 — 11, 15, 21, 24, 28. 

\sub second subsidiary entry, 1, 3, 6 — 7, 10, 15, 21, 25, '/, 

28, 31. 

\endindex 

where 1 . . . ’ stands for other entries. Each line of input normally specifies one main 
entry or one subsidiary entry; if an entry is so long that it doesn’t fit on a single input 
line, ‘u '/.’ is typed at the end of the line so that it merges with the following one. 

/gb/gb The interesting thing about this index problem is that it is desirable to set 
JL JL up a system of marks so that the output routine can insert special lines of 
text when an entry has been broken between columns or pages. For example, if a page 
break occurs between any of the six lines of typeset output shown above, the output 
routine should emit the special line 

Main entry ( continued ): 

and if a page break occurs within a subsidiary entry, an additional special line 
subsidiary entry ( continued ): 

should also appear. The solution below produces marks so that \botmark will be null if 
a break occurs between main entries; it will be ‘Main entry’ if a break occurs after lines 
1, 2, or 4 of the six example output lines; it will be ‘Main entry\sub first subsidiary 
entry’ if a break occurs after line 3 (within the first subsidiary entry); and it will be 
‘Main entry\sub second subsidiary entry’ if a break occurs after line 5. 
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/>>/>) The reader may wish to try solving this problem before looking at the solution, 
JL JL because it will then be easier to appreciate the subtler issues that are involved. 
(Go ahead: Try to define a macro \beginindex that does the ragged-right setting and 
produces the specified marks. Turn back to the previous page to study the problem 
carefully, before peeking at the answer.) 

\def\beginindex{\begingroup 

\parindent=lem \maxdepth=\maxdimen 
\def\par{\endgraf \futurelet\next\inxentry} 

\obeylines \everypar={\hangindent 2\parindent} 

\exhyphenpenalty= 10000 \raggedright} 

\def\inxentry{\ifx\next\sub \let\next=\ subentry 
\else\ifx\next\endindex \let\next=\vf ill 
\else\let\next=\mainentry \fi\fi \next} 
\def\endindex{\mark{}\break\endgroup} 

\let\sub=\indent \newtoks\maintoks \newtoks\subtoks 
\def \mainentry#l , {\mark{}\noindent 

\maintoks={#l}\mark{\the\maintoks}#l ,} 

\def\subentry\sub#l ,{\mark{\the\maintoks}\indent 

\subtoks={#l}\mark{\the\maintoks\sub\the\subtoks}#l ,} 

Even if you have read this solution, you probably want an explanation of what it does, 
because it uses “Tj^Xtics” that have not appeared before in this manual. 

1) The \beginindex macro uses \begingroup to keep other changes local; thus, 
it won’t be necessary to restore \par indent and \maxdepth, etc., to their former values 
when the index is finished. The \maxdepth parameter is set to \maxdimen, which is 
essentially infinite, so that \box255 will have the true depth of the last box that it 
contains; we will use this fact below. (It is safe to disable \maxdepth in this way, since 
the entries in an index can be assumed to have reasonably small depth.) Notice that 
\obeylines is used, so that \par will effectively be inserted at the end of every line of 
input. The meaning of \par is changed so that it does more than usual: First it does 
\endgraf , which is TjgX’s ordinary \par operation; then it sets \next to the first token 
of the next line, after which the macro \inxentry will be expanded. 

2) When \inxentry comes into play it looks at \next to decide what to do. 
There are three cases: If \next is ‘\sub’, the line will be treated as a subsidiary entry; 
if \next is ‘\endindex’, the next commands executed will be ‘\vf ill\mark{}\break 
\endgroup’; otherwise the line will be treated as a main entry. 

3) The text of a main entry is put into parameter #1 of \mainentry; this param- 
eter is delimited by a comma. The first thing that \mainentry does is l \mark{}’, which 
clears the mark in case of a break between entries. Then comes ‘\noindent’, which 
causes T^X to go into horizontal mode and to emit \parskip glue. (The \parskip 
glue will be a legal breakpoint between lines; it will later be followed by interline glue, 
when the first line of the main entry has been typeset by TeX’s paragraphing routine.) 
Then another \mark is put into the paragraph itself; this one contains the text of the 
main entry, and a \toks register called \maintoks is used to inhibit expansion of the 
mark text. When the paragraph is completed and broken into lines, this particular 
mark will immediately follow the box for the paragraph’s first line, so it will be the 
\botmark if a page break occurs anywhere within the paragraph. 


everypar 

futurelet 

exhyphenpenalty 

raggedright 

hangindent 

begingroup 

parindent 

maxdepth 
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noindent 

parskip 

toks 
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inhibit expansion 
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4) A similar construction is used for \subentry, but the mark is more compli- 
cated. The \maintoks register will still contain the main entry. The text for the 
subsidiary entry is added using another token list register, \subtoks. Since \sub has 
been defined to equal \indent, it will not be expanded in this \mark. 

The macros just defined will typeset entries that contain the necessary marks; 
JL JL now we must construct an output routine that uses these marks in the desired 
way, to insert new lines that say 1 (continued)' as mentioned above. Again, the reader 
is advised to try solving this problem before looking at the following solution. 

\output={\dimen0=\dp255 \normaloutput 
\expandaf ter\ inxche ck\botmark\ sub\end} 

\def\inxche ck#l\sub#2\end{\def \next{#l}'/, 

\ifx\next\empty '/, do nothing if \botmark is null 
\else\noindent #l\continued '/, ‘Main entry (continued) : ’ 

\def \next{#2}'/, 

\ifx\next\empty '/, nothing more if \botmark has no \sub 
\else\let\sub=\continued \indent #2\fi 
\advance\dimenO by-\prevdepth \kern\dimenO \fi} 

\def\continued{ ({\it continued}\thinspace) : \endgraf } 


This coding is a bit more subtle than usual. It assumes that \normaloutput takes care 
of shipping out \box255 (possibly putting it into multicolumn format) and advanc- 
ing the page number; then comes new stuff, which is performed by \ inxche ck. The 
\inxcheck macro is invoked in an interesting way that allows \botmark to be separated 
into its components. If \botmark is null, argument #1 to \inxcheck will be null; hence 
\next will be found equivalent to \empty. (Plain IjgX says ‘\def \empty{}’ in order to 
accommodate situations like this.) If \botmark doesn’t contain the token \sub, argu- 
ment #1 will be the contents of \botmark while #2 will be null. Otherwise, if \botmark 
lias the form a\sub /3, argument #1 will be a and #2 will be ‘/3\sub’. 

/>)/>) If \botmark isn’t null, the \ inxche ck macro produces one or more lines of text 
JL JL that will be contributed to T^X’s main vertical list at the position of the page 
break. And here’s where the most subtle point arises: There will be interline glue at 
the page break, computed on the basis of the depth of the box that preceded the break. 
That depth is known to the output routine, since it’s the depth of \box255. (The value 
of \maxdepth was made infinite for precisely this reason.) Therefore the \inxcheck 
macro can insert a \kern to compensate for the difference in depth between the old 
box and the one that will be inserted before the interline glue that has already been 
computed. Without this \kern, the spacing would be wrong. The reader should study 
this example carefully, to understand the reasoning behind the \kern command, before 
designing an output routine that inserts new boxes between random lines of output. 



► EXERCISE 23.7 

Modify this construction so that continuation lines are inserted only in the 


left columns of even-numbered pages, assuming two-column format. 



► EXERCISE 23.8 

True or false: The \ inxche ck macro in this example contributes at most two 


lines of output to the main vertical list. 


indent 

macro arguments 
empty 

interline glue 

maxdepth 

kern 
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/gK/gK When TgX sees an \end command, it terminates the job only if the main 
JL JL vertical list has been entirely output and if \deadcycles=0. Otherwise it 
inserts the equivalent of 

\line{} \vf ill \penalty- > 10000000000 

into the main vertical list, and prepares to read the ‘\end’ token again. This has the 
effect of invoking the output routine repeatedly until everything has been shipped out. 
In particular, the last column of two-column format will not be lost. 

/gK/gK It is possible to devise output routines that always leave a residue on the main 
JL JL vertical list, yet they never allow \deadcycles to increase. In such a case TjjX 
will never come to an end! An output routine can recognize that it is being invoked by 
TjgX’s endgame, because of the highly negative \outputpenalty caused by the special 
\penalty- ’ 10000000000. At such times the output routine should modify its behavior, 
if necessary, so that a happy ending will ensue. 


end 

deadcycles 

penalty-’ 10000000000 
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SHERIDAN 

LEONTIEF 


I think you will like them, 
when you shall see them on a beautiful quarto page, 
where a neat rivulet of text 
shall meander through a meadow of margin. 
'Fore Gad they will be the most elegant things of their kind! 
— RICHARD BRINSLEY SHERIDAN, The School for Scandal (1777) 

The influence of technical changes upon outputs 
through variation in the general investment level P 
is so small that actually it could have been neglected. 
WASSILY W. LEONTIEF, The Structure of American Economy, 1919-1929 (1941) 
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The whole TgX language has been presented in the previous chapters; we have fi- 
nally reached the end of our journey into previously uncharted territory. Hurray! 
Victory! Now it is time to take a more systematic look at what we have encoun- 
tered: to consider the facts in an orderly manner, rather than to mix them up 
with informal examples and applications as we have been doing. A child learns 
to speak a language before learning formal rules of grammar, but the rules of 
grammar come in handy later on when the child reaches adulthood. The purpose 
of this chapter — and of the two chapters that follow — is to present a precise and 
concise summary of the language that TgX understands, so that mature users 
will be able to communicate as effectively as possible with the machine. 

We will be concerned in these chapters solely with TpXcs primitive oper- 
ations, rather than with the higher-level features of plain TpX format that most 
people deal with. Therefore novice users should put off reading Chapters 24-26 
until they feel a need to know what goes on inside the computer. Appendix B 
contains a summary of plain T^X, together with a ready-reference guide to the 
things that most people want to know about TgX usage. The best way to get an 
overview of TgX from a high level is to turn to the opening pages of Appendix B. 

Our purpose here, however, is to survey the low-level parts of TjjX on which 
higher-level superstructures have been built, in order to provide a detailed reference 
for people who do need to know the details. The remainder of this chapter is set in 
small type, like that of the present paragraph, since it is analogous to material that is 
marked “doubly dangerous” in other chapters. Instead of using dangerous bend signs 
repeatedly, let us simply agree that Chapters 24-26 are dangerous by definition. 

TgX actually has a few features that didn’t seem to be worth mentioning in 
previous chapters, so they will be introduced here as part of our complete survey. If 
there is any disagreement between something that was said previously and something 
that will be said below, the facts in the present chapter and its successors should be 
regarded as better approximations to the truth. 

We shall study TjrX’s digestive processes, i.e., what IjgX does with the lists 
of tokens that arrive in its “stomach.” Chapter 7 has described the process by which 
input files are converted to lists of tokens in TjrX’s “mouth,” and Chapter 20 explained 
how expandable tokens are converted to unexpandable ones in TjrX’s “gullet” by a 
process similar to regurgitation. When unexpandable tokens finally reach IjgX’s gastro- 
intestinal tract, the real activity of typesetting begins, and that is what we are going 
to survey in these summary chapters. 

Each token that arrives in TjjX’s tummy is considered to be a command that 
the computer will obey. For example, the letter ‘L’ is a command to typeset an ‘L’ 
in the current font; ‘\par’ tells TjrX to finish a paragraph. TjgX is always in one of 
six modes, as described in Chapter 13, and a command sometimes means different 
things in different modes. The present chapter is about vertical mode (and internal 
vertical mode, which is almost the same): We shall discuss TjgX’s response to every 
primitive command, when that command occurs in vertical mode. Chapters 25 and 26 
characterize horizontal mode and math mode in a similar way, but those chapters are 
shorter than this one because many commands have the same behavior in all modes; 
the rules for such commands will not be repeated thrice, they will appear only once. 


vertical mode 

primitive 

truth 

anatomy of TeX 
command 
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Some commands have arguments. In other words, one or more of the tokens 
that follow a command might be used to modify that command’s behavior, and those 
tokens are not considered to be commands themselves. For example, when TjgX pro- 
cesses the sequence of tokens that corresponds to ‘\dimen2=2 . 5pt’, it considers only 
the first token ‘\dimen’ to be a command; the next tokens are swept up as part of the 
operation, because T^X needs to know what \dimen register is to be set equal to what 
(dimen) value. 

We shall define l^rX’s parts of speech by using a modified form of the gram- 
matical notation that was introduced about 1960 by John Backus and Peter Naur for 
the definition of computer languages. Quantities in angle brackets will either be ex- 
plained in words or they will be defined by syntax rules that show exactly how they 
are formed from other quantities. For example, 

(unit of measure) — > (optional spaces) (internal unit) 

| (optional true) (physical unit) 

defines a (unit of measure) to be either an occurrence of (optional spaces) followed by 
an (internal unit), or (optional true) followed by (physical unit). The symbol 1 — > ’ 
in a syntax rule means “is defined to be,” and 1 | ’ means “or.” 

Sometimes a syntax rule is recursive, in the sense that the right-hand side of 
the definition involves the quantity being defined. For example, the rule 

(optional spaces) — > (empty) | (space token) (optional spaces) 

defines the grammatical quantity called (optional spaces) to be either (empty), or a 
(space token) followed by (optional spaces). The quantity (empty) stands for “noth- 
ing,” i.e., for no tokens at all; hence the syntax rule just given is a formalized way of 
saying that (optional spaces) stands for a sequence of zero or more spaces. 

The alternatives on the right-hand side of a syntax rule need not consist 
entirely of quantities in angle brackets. Explicit tokens can be used as well. For 
example, the rule 

(plus or minus) — >■ +12 | -12 

says that (plus or minus) stands for a character token that is either a plus sign or 
a minus sign, with category code 12. 

We shall use a special convention for keywords, since the actual syntax of a 
keyword is somewhat technical. Letters in typewriter type like 1 pt ’ will stand for 

(optional spaces) (p or P)(t or T), 

where (p or P) denotes any non-active character token for either p or P (independent 
of the category code), and where (t or T) is similar. 

When a control sequence like ‘\dimen’ is used in the syntax rules below, it 
stands for any token whose current meaning is the same as the meaning that \dimen 
had when Tj?X started up. Other tokens can be given this same meaning, using \let 
or \futurelet, and the meaning of the control sequence \dimen itself may be redefined 
by the user, but the syntax rules take no note of this; they just use ‘\dimen’ as a 
way of referring to a particular primitive command of Jj^X. (This notation is to be 
distinguished from ‘ |dimen| ’, which stands for the control sequence token whose actual 
name is dimen; see Chapter 7.) 
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Control sequences sometimes masquerade as characters, if their meaning has 
been assigned by \let or \futurelet. For example, Appendix B says 

\let\bgroup={ \let\egroup=} 

and these commands make \bgroup and \egroup act somewhat like left and right curly 
braces. Such control sequences are called “implicit characters” ; they are interpreted in 
the same way as characters, when IgX acts on them as commands, but not always when 
they appear in arguments to commands. For example, the command ‘\let\plus=+’ 
does not make \plus an acceptable substitute for the character token ‘+ 12 ’ in the 
syntax rule for (plus or minus) given above, nor does the command ‘\let\p=p’ make 
\p acceptable as part of the keyword pt . When TJjX’s syntax allows both explicit and 
implicit characters, the rules below will be careful to say so, explicitly. 

The quantity (space token), which was used in the syntax of (optional spaces) 
above, stands for an explicit or implicit space. In other words, it denotes either a 
character token of category 10, or a control sequence or active character whose current 
meaning has been made equal to such a token by \let or \futurelet. 

It will be convenient to use the symbols ‘{’, ‘}\ and ‘$ ! to stand for any 
explicit or implicit character tokens of the respective categories 1 , 2 , and 3 , whether or 
not the actual character codes are braces or dollar signs. Thus, for example, plain T^X’s 
\bgroup is an example of a and so are the tokens ‘-Ci 1 and ‘ (i but ‘{12’ is not. 

The last few paragraphs can be summarized by saying that the alternatives 
on the right-hand sides of TgX’s formal syntax rules are made from one or more of the 
following things: (1) syntactic quantities like (optional spaces); (2) explicit character 
tokens like +12; ( 3 ) keywords like pt ; ( 4 ) control sequence names like \dimen; or ( 5 ) the 
special symbols {, }, $. 

Let us begin our study of T^X’s syntax by discussing the precise meanings of 
quantities like (number), (dimen), and (glue) that occur frequently as arguments to 
commands. The most important of these is (number), which specifies an integer value. 
Here’s exactly what a (number) is: 

(number) — J- (optional signs) (unsigned number) 

(optional signs) — > (optional spaces) 

| (optional signs) (plus or minus) (optional spaces) 

(unsigned number) — > (normal integer) | (coerced integer) 

(normal integer) — > (internal integer) 

| (integer constant) (one optional space) 

| ’ 12 (octal constant) (one optional space) 

| "12 (hexadecimal constant) (one optional space) 

| ‘ 12 (character token) (one optional space) 

(integer constant) — > (digit) | (digit) (integer constant) 

(octal constant) — > (octal digit) | (octal digit) (octal constant) 

(hexadecimal constant) — > (hex digit) | (hex digit) (hexadecimal constant) 
(octal digit) — > O12 | I12 | 2i 2 | 3 i 2 | 4 i 2 | 5i2 | 612 | 7 i 2 
(digit) — >■ (octal digit) | 812 | 9 r 2 
(hex digit) — » (digit) | An | Bn | Cn | Du | Eu | F11 
I A12 I B12 I C12 I D12 I E12 I F 12 
(one optional space) — > (space token) | (empty) 

(coerced integer) — >■ (internal dimen) | (internal glue) 
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The value of a (number) is the value of the corresponding (unsigned number), times — 1 
for every minus sign in the (optional signs) . An alphabetic constant denotes the char- 
acter code in a (character token); T^X does not expand this token, which should either 
be a (character code, category code) pair, or an active character, or a control se- 
quence whose name consists of a single character. (See Chapter 20 for a complete 
list of all situations in which T)?X does not expand tokens.) An (integer constant) 
must not be immediately followed by a (digit); in other words, if several digits appear 
consecutively, they are all considered to be part of the same (integer constant). A 
similar remark applies to the quantities (octal constant) and (hexadecimal constant). 
The quantity (one optional space) is (empty) only if it has to be; i.e. , T^X looks for 
(one optional space) by reading a token and backing up if a (space token) wasn’t there. 

EXERCISE 24.1 

JL JL Can you think of a reason why you might want ‘Ai 2 ? to be a (hex digit) even 
though the letter A has category 11? (Don’t worry if your answer is “no.” ) 

The definition of (number) is now complete except for the three quantities 
called (internal integer), (internal dimen), and (internal glue), which will be explained 
later; they represent things like parameters and registers. For example, \ count 1 
and \tolerance and \hyphenchar\tenrm are internal integers; \dimenl0 and \hsize 
and \f ontdimen6\tenrm are internal dimensions; XskiplOO and \baselineskip and 
\lastskip are internal glue values. An internal dimension can be “coerced” to be an 
integer by assuming units of scaled points. For example, if \hsize=100pt and if \hsize 
is used in the context of a (number), it denotes the integer value 6553600. Similarly, an 
internal glue value can be coerced to be an integer by first coercing it to be a dimension 
(omitting the stretchability and shrinkability), then coercing that dimension. 

Let’s turn now to the syntax for (dimen), and for (mudimen) its cousin: 

(dimen) — )• (optional signs) (unsigned dimen) 

(unsigned dimen) — )• (normal dimen) | (coerced dimen) 

(coerced dimen) — l (internal glue) 

(normal dimen) — (internal dimen) | (factor) (unit of measure) 

(factor) — > (normal integer) | (decimal constant) 

(decimal constant) — >• .12 | ,12 
| (digit) (decimal constant) 

| (decimal constant) (digit) 

(unit of measure) — t (optional spaces) (internal unit) 

| (optional true) (physical unit) (one optional space) 

(internal unit) — >- em (one optional space) | ex (one optional space) 

| (internal integer) | (internal dimen) | (internal glue) 

(optional true) — 1 true | (empty) 

(physical unit) — > pt | pc | in | bp | cm | mm | dd | cc | sp 
(mudimen) — >■ (optional signs) (unsigned mudimen) 

(unsigned mudimen) — )• (normal mudimen) | (coerced mudimen) 

(coerced mudimen) — >- (internal niuglue) 

(normal mudimen) — » (factor) (mu unit) 

(mu unit) — t (optional spaces) (internal niuglue) | mu (one optional space) 

When ‘true’ is present, the factor is multiplied by 1000 and divided by the \mag 
parameter. Physical units are defined in Chapter 10; mu is explained in Chapter 18. 
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Encouraged by our success in mastering the precise syntax of the quantities 
(number), (dimen), and (mudimen), let’s tackle (glue) and (muglue): 

(glue) — > (optional signs) (internal glue) 

| (dimen) (stretch) (shrink) 

(stretch) — > plus (dimen) | plus (fil dimen) | (optional spaces) 

(shrink) — > minus (dimen) | minus (fil dimen) | (optional spaces) 

(fil dimen) — > (optional signs) (factor) (fil unit) (optional spaces) 

(fil unit) — > fil | (fil unit) 1 

(muglue) — » (optional signs) (internal muglue) 

| (mudimen) (mustretcli) (mushrink) 

(mustretcli) — )• plus (mudimen) | plus (fil dimen) | (optional spaces) 
(mushrink) — > minus (mudimen) | minus (fil dimen) | (optional spaces) 

T|.'X makes a large number of internal quantities accessible so that a format 
designer can influence T^X’s behavior. Here is a list of all these quantities, except for 
the parameters (which will be listed later). 

(internal integer) — )• (integer parameter) | (special integer) | \lastpenalty 
| (countdef token) | \count (8-bit number) | (codename) (8-bit number) 

| (cliardef token) | (matlichardef token) | \par shape | \inputlineno 
| \hyphenchar(font) | \skewchar(font) | \badness 
(special integer) — > \spacefactor | \prevgraf 
| \deadcycles | \insertpenalties 
(codename) — > \catcode | \mathcode 

| \lccode | \uccode | \sfcode | \delcode 
(font) — l (fontdef token) | \font | (family member) 

(family member) — > (font range) (4-bit number) 

(font range) — > \textfont | \scriptfont | \scriptscriptfont 
(internal dimen) — (dimen parameter) | (special dimen) | \lastkern 
| (dimendef token) | \dimen(8-bit number) 

| (box dimension) (8-bit number) | \fontdimen(number)(font) 

(special dimen) — 1 \prevdepth | \pagegoal | \pagetotal 

| \pagestretch | \pagef ilstretch | \pagef illstretch 
| \pagef illlstretch | \pageshrink | \pagedepth 
(box dimension) — > \ht | \wd | \dp 
(internal glue) — » (glue parameter) | \lastskip 
| (skipdef token) | \skip(8-bit. number) 

(internal muglue) — >■ (muglue parameter) | \lastskip 
| (muskipdef token) | \muskip(8-bit number) 

A (countdef token) is a control sequence token in which the control sequence’s current 
meaning has been defined by \ countdef; the other quantities (dimendef token), etc., 
are defined similarly. A (fontdef token) refers to a definition by \font, or it can be 
the predefined font identifier called \nullfont. When a (countdef token) is used as an 
internal integer, it denotes the value of the corresponding \ count register, and similar 
statements hold for (dimendef token), (skipdef token), (muskipdef token). When a 
(cliardef token) or (matlichardef token) is used as an internal integer, it denotes the 
value in the \chardef or \mathchardef itself. An (8-bit number) is a (number) whose 
value is between 0 and 2 s — 1 = 255; a (4-bit number) is similar. 
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TjjX allows \spacefactor to be an internal integer only in horizontal modes; 
\prevdepth can be an internal dimension only in vertical modes; \lastskip can be 
(internal muglue) only in math mode when the current math list ends with a miiglue 
item; and \lastskip cannot be (internal glue) in such a case. When \parshape is used 
as an internal integer, it denotes only the number of controlled lines, not their sizes or 
indentations. The seven special dimensions \pagetotal, \pagestretch, and so on are 
all zero when the current page contains no boxes, and \pagegoal is \maxdimen at such 
times (see Chapter 15). 

From the syntax rules just given, it’s possible to deduce exactly what hap- 
pens to spaces when they are in the vicinity of numerical quantities: IjgX allows 
a (number) or (dimen) to be preceded by arbitrarily many spaces, and to be fol- 
lowed by at most one space; however, there is no optional space after a (number) 
or (dimen) that ends with an unexpandable control sequence. For example, if TgX 
sees ‘\space\space24\space\space’ when it is looking for a (number), it gobbles up 
the first three spaces, but the fourth one survives; similarly, one space remains when 
‘24pt\space\space’ and ‘\dimen24\space\space’ and ‘\pagegoal\space’ are treated 
as (dimen) values. 

EXERCISE 24.2 

X X Is ‘24\space\space pt’ a legal (dimen)? 


<^> .►EXERCISE 24.3 

Is there any difference between ‘+\baselineskip’, 
‘lAbaselineskip’, when TjjX reads them as (glue)? 


-\baselineskip’, and 


/>)/>>► EXERCISE 24.4 

JL JL What (glue) results from "DD DDPLUS2,5 \spacef actor\space, assuming the 
conventions of plain Tj?X, when \spacefactor equals 1000? 


Let’s turn now to TpjX’s parameters, which the previous chapters have in- 
troduced one at a time; it will be convenient to assemble them all together. An 
(integer parameter) is one of the following tokens: 

\pretolerance (badness tolerance before hyphenation) 

\tolerance (badness tolerance after hyphenation) 

\hbadness (badness above which bad hboxes will be shown) 

\vbadness (badness above which bad vboxes will be shown) 

\linepenalty (amount added to badness of every line in a paragraph) 
\hyphenpenalty (penalty for line break after discretionary hyphen) 
\exhyphenpenalty (penalty for line break after explicit hyphen) 
\binoppenalty (penalty for line break after binary operation) 

\relpenalty (penalty for line break after math relation) 

\clubpenalty (penalty for creating a club line at bottom of page) 
Xwidowpenalty (penalty for creating a widow line at top of page) 
\displaywidowpenalty (ditto, before a display) 

\brokenpenalty (penalty for page break after a hyphenated line) 
\predisplaypenalty (penalty for page break just before a display) 
\postdisplaypenalty (penalty for page break just after a display) 
\interlinepenalty (additional penalty for page break between lines) 
\floatingpenalty (penalty for insertions that are split) 
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\outputpenalty (penalty at the current, page break) 
\doublehyphendemerits (demerits for consecutive broken lines) 

\f inalhyphendemerits (demerits for a penultimate broken line) 

\adj demerits (demerits for adjacent incompatible lines) 

\looseness (change to the number of lines in a paragraph) 

\pausing (positive if pausing after each line is read from a file) 
\holdinginserts (positive if insertions remain dormant, in output box) 
\tracingonline (positive if showing diagnostic info on the terminal) 
\tracingmacros (positive if showing macros as they are expanded) 
\tracingstats (positive if showing statistics about memory usage) 
\tracingparagraphs (positive if showing line-break calculations) 
\tracingpages (positive if showing page-break calculations) 

\tracingoutput (positive if showing boxes that, are shipped out) 
\tracinglostchars (positive if showing characters not. in the font) 
\tracingcommands (positive if showing commands before they are executed) 
\tracingrestores (positive if showing deassignments when groups end) 
\language (the current set of hyphenation rules) 

\uchyph (positive if hyphenating words beginning with capital letters) 
\lefthyphenmin (smallest fragment at beginning of hyphenated word) 
\righthyphenmin (smallest fragment at end of hyphenated word) 
\globaldefs (nonzero if overriding \global specifications) 
\defaulthyphenchar (\hyphenchar value when a font, is loaded) 
\defaultskewchar (\skewchar value when a font is loaded) 

\escapechar (escape character in the output, of control sequence tokens) 
\endlinechar (character placed at the right end of an input line) 
\newlinechar (character that starts a new output line) 

\maxdeadcycles (upper bound on \deadcycles) 

\hangafter (hanging indentation changes after this many lines) 

\fam (the current family number) 

\mag (magnification ratio, times 1000) 

\delimiterf actor (ratio for variable delimiters, times 1000) 

\time (current time of day in minutes since midnight) 

\day (current day of the month) 

\month (current month of the year) 

\year (current year of our Lord) 

\showboxbreadth (maximum items per level when boxes are shown) 
\showboxdepth (maximum level when boxes are shown) 
\errorcontextlines (maximum extra context shown when errors occur) 

The first few of these parameters have values in units of “badness” and “penalties” 
that affect line breaking and page breaking. Then come demerit-oriented parameters; 
demerits are essentially given in units of “badness squared,” so those parameters tend to 
have larger values. By contrast., the next few parameters (\looseness, \pausing, etc.) 
generally have quite small values (either —1 or 0 or 1 or 2). Miscellaneous parameters 
complete the set. TjjX computes the date and time when it begins a job, if the operating 
system provides such information; but afterwards the clock does not keep ticking: The 
user can change \time just like any ordinary parameter. Chapter 10 points out that 
\mag must not. be changed after TjrX is committed to a particular magnification. 
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A {dimen parameter) is one of the following: 

\hfuzz (maximum overrun before overfull libox messages occur) 

\vfuzz (maximum overrun before overfull vbox messages occur) 
\overfullrule (width of rules appended to overfull boxes) 
\emergencystretch (reduces badnesses on final pass of line-breaking) 
\hsize (line width in horizontal mode) 

\vsize (page height in vertical mode) 

\maxdepth (maximum depth of boxes on main pages) 

\splitmaxdepth (maximum depth of boxes on split pages) 

\boxmaxdepth (maximum depth of boxes on explicit pages) 
\lineskiplimit (threshold where \baselineskip changes to \lineskip) 
\delimitershortfall (maximum space not covered by a delimiter) 
\nulldelimiterspace (width of a null delimiter) 

\scriptspace (extra space after subscript or superscript) 

\mathsurround (kerning before and after math in text) 

\predisplaysize (length of text preceding a display) 

\displaywidth (length of line for displayed equation) 

\displayindent (indentation of line for displayed equation) 

\parindent (width of \indent) 

\hangindent (amount of hanging indentation) 

\hoffset (horizontal offset in \shipout) 

\voffset (vertical offset in \shipout) 

And the possibilities for (glue parameter) are: 

\baselineskip (desired glue between baselines) 

\lineskip (interline glue if \baselineskip isn’t feasible) 

\parskip (extra glue just above paragraphs) 

\abovedisplayskip (extra glue just above displays) 
\abovedisplayshortskip (ditto, following short lines) 
\belowdisplayskip (extra glue just below displays) 
\belowdisplayshortskip (ditto, following short lines) 

\leftskip (glue at left of justified lines) 

\rightskip (glue at right of justified lines) 

\topskip (glue at top of main pages) 

\splittopskip (glue at top of split pages) 

\tabskip (glue between aligned entries) 

\spaceskip (glue between words, if nonzero) 

\xspaceskip (glue between sentences, if nonzero) 

\parfillskip (additional \rightskip at end of paragraphs) 

Finally, there are three permissible (muglue parameter) tokens: 

\thinmuskip (thin space in math formulas) 

\medmuskip (medium space in math formulas) 

\thickmuskip (thick space in math formulas) 
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All of these quantities are explained in more detail somewhere else in this book, and 
you can use Appendix I to find out where. 
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T|-'X also lias parameters that are token lists. Such parameters do not enter 
into the definitions of (number) and such things, but we might as well list them now 
so that our tabulation of parameters is complete. A (token parameter) is any of: 

\output (the user’s output routine) 

\everypar (tokens to insert when a paragraph begins) 

\everymath (tokens to insert when math in text begins) 

\everydisplay (tokens to insert when display math begins) 

\everyhbox (tokens to insert when an hbox begins) 

\everyvbox (tokens to insert when a vbox begins) 

\everyjob (tokens to insert when the job begins) 

\everycr (tokens to insert after every \cr or nonredundant \crcr) 
\errhelp (tokens that supplement an \errmessage) 

That makes a total of 103 parameters of all five kinds. 

EXERCISE 24.5 

JL JL Explain how \everyjob can be non-null when a job begins. 

It’s time now to return to our original goal, namely to study the commands 
that are obeyed by TjrX’s digestive organs. Many commands are carried out in the same 
way regardless of the current mode. The most important commands of this type are 
called assignments, since they assign new values to the meaning of control sequences or 
to TjgX’s internal quantities. For example, ‘\def\a{a}’ and ‘\parshape=l 5pt lOOpt’ 
and ‘\advance\count20 by-1’ and ‘\font\ff = cmff at 20pt’ are all assignments, 
and they all have the same effect in all modes. Assignment commands often include 
an = sign, but in all cases this sign is optional; you can leave it out if you don’t mind 
the fact that the resulting TjrX code might not look quite like an assignment. 

(assignment) — >■ (non-macro assignment) | (macro assignment) 

(non-macro assignment) — >• (simple assignment) 

| \global (non-macro assignment) 

(macro assignment) — (definition) | (prefix) (macro assignment) 

(prefix) — > \global | \long | \outer 

(equals) — > (optional spaces) | (optional spaces) =12 

This syntax shows that every assignment can be prefixed by \global, but only macro- 
definition assignments are allowed to be prefixed by \long or \outer. Incidentally, if the 
\globaldef s parameter is positive at the time of the assignment, a prefix of \global is 
automatically implied; but if \globaldefs is negative at the time of the assignment, a 
prefix of \global is ignored. If \globaldef s is zero (which it usually is), the appearance 
or nonappearance of \global determines whether or not a global assignment is made. 

(definition) — > (def) (control sequence) (definition text) 

(def) — > \def | \gdef | \edef | \xdef 

(definition text) — > (parameter text) (left brace) (balanced text) (right brace) 

Here (control sequence) denotes a token that is either a control sequence or an active 
character; (left brace) and (right brace) are explicit character tokens whose category 
codes are respectively of types 1 and 2. The (parameter text) contains no (left brace) 
or (right brace) tokens, and it obeys the rules of Chapter 20. All occurrences of 
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(left brace) and (right brace) tokens within the (balanced text) must be properly nested 
like parentheses. A \gdef command is equivalent to \global\def , and \xdef is equiv- 
alent to \global\edef . T{rX reads the (control sequence) and (parameter text) tokens 
and the opening (left brace) without expanding them; it expands the (balanced text) 
(right brace) tokens only in the case of \edef and \xdef. 

Several commands that we will study below have a syntax somewhat like that 
of a definition, but the (parameter text) is replaced by an arbitrary sequence of spaces 
and ‘\relax’ commands, and the (left brace) token can be implicit: 

(filler) — > (optional spaces) | (filler)\relax(optional spaces) 

(general text) — > (filler) {(balanced text) (right brace) 

The main purpose of a (general text) is to specify the (balanced text) inside. 

Many different kinds of assignments are possible, but they fall into compara- 
tively few patterns, as indicated by the following syntax rules: 

(simple assignment.) — > (variable assignment) | (arithmetic) 

| (code assignment) | (let assignment) | (shorthand definition) 

| (font.def token) | (family assignment) | (shape assignment) 

| \read(number) to (optional spaces) (control sequence) 

| \setbox (8-bit number) (equals) (filler) (box) 

| \font (control sequence) (equals) (file name) (at clause) 

| (global assignment) 

(variable assignment) — >• (integer variable) (equals) (number) 

| (dimen variable) (equals) (dimen) 

| (glue variable) (equals) (glue) 

| (muglue variable) (equals) (muglue) 

| (token variable) (equals) (general text) 

| (token variable) (equals) (filler) (token variable) 

(arithmetic) — >• \ advance (integer variable) (optional by) (number) 

| \advance(dimen variable) (optional by)(dimen) 

| \advance(glue variable) (optional by) (glue) 

| \ advance (muglue variable) (optional by) (muglue) 

| \multiply (numeric variable) (optional by) (number) 

| \di vide (numeric variable) (optional by) (number) 

(optional by) — > by | (optional spaces) 

(integer variable) — >• (integer parameter) | (countdef token) 

| \ count (8-bit number) 

(dimen variable) — >• (dimen parameter) | (dimendef token) 

| \dimen(8-bit number) 

(glue variable) — > (glue parameter) | (skipdef token) 

| \ skip (8-bit number) 

(muglue variable) — >- (muglue parameter) | (muskipdef token) 

| \muskip (8-bit number) 

(token variable) — > (token parameter) | (toksdef token) 

| \toks (8-bit. number) 

(numeric variable) — > (integer variable) | (dimen variable) 

| (glue variable) | (muglue variable) 
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(code assignment) — > (code name) (8-bit number) (equals) (number) 

(let assignment) — > \futurelet (control sequence) (token) (token) 

| \let(control sequence) (equals) (one optional space) (token) 

(shorthand definition) — > \chardef (control sequence) (equals) (8-bit number) 

| \mathchardef (control sequence) (equals) (15-bit number) 

| (registerdef ) (control sequence) (equals) (8-bit number) 

(registerdef) — > \countdef | \dimendef | \skipdef | \muskipdef | \toksdef 
(family assignment) — >■ (family member) (equals) (font) 

(shape assignment) — > \parshape (equals) (number) (shape dimensions) 

The (number) at the end of a (code assignment) must not be negative, except in the 
case that a \delcode is being assigned. Furthermore, that (number) should be at most 
15 for \catcode, 32768 for \mathcode, 255 for \lccode or \uccode, 32767 for \sfcode, 
and 2 24 — 1 for \delcode. In a (shape assignment) for which the (number) is n, the 
(shape dimensions) are (empty) if n < 0, otherwise they consist of 2 n consecutive 
occurrences of (dimen). TjgX does not expand tokens when it scans the arguments of 
\let and \futurelet. 

EXERCISE 24.6 

JL JL We discussed the distinction between explicit and implicit character tokens 
earlier in this chapter. Explain how you can make the control sequence \cs into an 
implicit space, using (a) \futurelet, (b) \let. 

All of the assignments mentioned so far will obey TpX 's grouping structure; 
i.e., the changed quantities will be restored to their former values when the current 
group ends, unless the change was global. The remaining assignments are different, 
since they affect TjgX’s global font tables or hyphenation tables, or they affect certain 
control variables of such an intimate nature that grouping would be inappropriate. In 
all of the following cases, the presence or absence of \global as a prefix has no effect. 

(global assignment) — > (font assignment.) 

| (hyphenation assignment) 

| (box size assignment) 

| (interaction mode assignment) 

| (intimate assignment) 

(font assignment) — > Montdimen(number) (font) (equals) (dimen) 

| \hyphenchar (font) (equals) (number) 

| \skewchar (font) (equals) (number) 

(at clause) — > at (dimen) | scaled (number) | (optional spaces) 
(hyphenation assignment) — > \hyphenation(general text) 

| \patterns (general text) 

(box size assignment) — > (box dimension) (8-bit number) (equals) (dimen) 
(interaction mode assignment) — > \errorstopmode | \scrollmode 
| \nonstopmode | \batchmode 

(intimate assignment) — (special integer) (equals) (number) 

| (special dimen) (equals) (dimen) 

When a \fontdimen value is assigned, the (number) must be positive and not greater 
than the number of parameters in the font’s metric information file, unless that font 
information has just been loaded into TjrX’s memory; in the latter case, you are allowed 
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to increase the number of parameters (see Appendix F). The (special integer) and 
(special dimen) quantities were listed above when we discussed internal integers and 
dimensions. When \prevgraf is set to a (number), the number must not be negative. 

The syntax for (file name) is not standard in TjrjX, because different operating 
systems have different conventions. You should ask your local system wizards for 
details on just how they have decided to implement file names. However, the following 
principles should hold universally: A (file name) should consist of (optional spaces) 
followed by explicit character tokens (after expansion). A sequence of six or fewer 
ordinary letters and/or digits followed by a space should be a file name that works in 
essentially the same way on all installations of TjgX. Uppercase letters are not considered 
equivalent to their lowercase counterparts in file names; for example, if you refer to fonts 
cmrlO and CMR10, T)?X will not notice any similarity between them, although it might 
input the same font metric file for both fonts. 

TjgX takes precautions so that constructions like ‘\chardef\cs=10\cs’ and 
‘\font\cs=name\cs’ won’t expand the second \cs until the assignments are done. 

Our discussion of assignments is complete except that the \setbox assignment 
involves a quantity called (box) that has not yet been defined. Here is its syntax: 

(box) — > \box(8-bit number) | \copy(8-bit number) 

| \lastbox | \vsplit(8-bit number) to (dimen) 

| \hbox(box specification) {(horizontal mode material)} 

| \vbox(box specification) {(vertical mode material)} 

| \vtop(box specification) {(vertical mode material)} 

(box specification) — > to (dimen) (filler) 

| spread (dimen) (filler) | (filler) 

The \lastbox operation is not permitted in math modes, nor is it allowed in vertical 
mode when the main vertical list has been entirely contributed to the current page. 
But it is allowed in horizontal modes and in internal vertical mode; in such modes it 
refers to (and removes) the last item of the current list, provided that the last item is 
an hbox or vbox. 

The three last alternatives for a (box) present us with a new situation: The 
(horizontal mode material) in an \hbox and the (vertical mode material) in a \vbox 
can’t simply be swallowed up in one command like an (8-bit number) or a (dimen); 
thousands of commands may have to be executed before that box is constructed and 
before the \setbox command can be completed. 

Here’s what really happens: A command like 

\setbox(number)=\hbox to (dimen){ (horizontal mode material)} 

causes TjgX to evaluate the (number) and the (dimen), and to put those values on 
a “stack” for safe keeping. Then TpX reads the ‘{’ (which stands for an explicit or 
implicit begin-group character, as explained earlier), and this initiates a new level of 
grouping. At this point TjrX enters restricted horizontal mode and proceeds to execute 
commands in that mode. An arbitrarily complex box can now be constructed; the fact 
that this box is eventually destined for a \setbox command has no effect on Tj?X’s 
behavior while the box is being built. Eventually, when the matching “}’ appears, 
TjgX restores values that were changed by assignments in the group just ended; then 
it packages the hbox (using the size that was saved on the stack), and completes the 
\setbox command, returning to the mode it was in at the time of the \setbox. 
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Let us now consider other commands that, like assignments, are obeyed in 
basically the same way regardless of Tj?X’s current mode. 

■ \relax. This is an easy one: TjrjX does nothing. 

■ }. This one is harder, because it depends on the current group. T^X should 
now be working on a group that began with {; and it knows why it started that 
group. So it does the appropriate finishing actions, undoes the effects of non-global 
assignments, and leaves the group. At this point T^X might leave its current mode and 
return to a mode that was previously in effect. 

■ \begingroup. When TjrX sees this command, it enters a group that must be 
terminated by \endgroup, not by }. The mode doesn’t change. 

■ \endgroup. TjgX should currently be processing a group that began with 
\begingroup. Quantities that were changed by non-global assignments in that group 
are restored to their former values. TjrX leaves the group, but stays in the same mode. 

■ \show (token), \showbox (8-bit number), \showlists, \showthe (internal 
quantity). These commands are intended to help you figure out what TgX thinks it 
is doing. The tokens following \showthe should be anything that can follow \the, as 
explained in Chapter 20. 

EXERCISE 24.7 

JL JL Review the rules for what can follow \the in Chapter 20, and construct a 
formal syntax that defines (internal quantity) in a way that fits with the other syntax 
rules we have been discussing. 

■ \shipout(box). After the (box) is formed — possibly by constructing it explic- 
itly and changing modes during the construction, as explained for \hbox earlier — its 
contents are sent to the dvi file (see Chapter 23). 

■ \ignorespaces (optional spaces). Tj?X reads (and expands) tokens, doing 
nothing until reaching one that is not a (space token). 

■ \afterassignment(token) . The (token) is saved in a special place; it will be 
inserted back into the input just after the next assignment command has been per- 
formed. An assignment need not follow immediately; if another \afterassignment is 
performed before the next assignment, the second one overrides the first. If the next 
assignment is a \setbox, and if the assigned (box) is \hbox or \vbox or \vtop, the 
(token) will be inserted just after the { in the box construction, not after the }; it will 
also come just before any tokens inserted by \everyhbox or \everyvbox. 

■ \aftergroup(token). The (token) is saved on TgX’s stack; it will be inserted 
back into the input just after the current group has been completed and its local 
assignments have been undone. If several \aftergroup commands occur in the same 
group, the corresponding commands will be scanned in the same order; for example, 
‘{\aftergroup\a\aftergroup\b}’ yields ‘\a\b’. 

■ \uppercase(general text), \lowercase(general text). The (balanced text) in 
the general text is converted to uppercase form or to lowercase form using the \uccode 
or \lccode table, as explained in Chapter 7; no expansion is done. Then Tj^X will read 
that (balanced text) again. 

■ \mes sage (general text), \errmessage(general text). The balanced text (with 
expansion) is written on the user’s terminal, using the format of error messages in the 
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case of \errmessage. In the latter case the \errhelp tokens will be shown if they are 
nonempty and if the user asks for help. 

■ \openin (4-bit number) (equals) (filename), \closein (4-bit number). These 
commands open or close the specified input stream, for use in \read assignments as 
explained in Chapter 20. 

■ \iiranediate\openout(4-bit number) (equals) (filename), \immediate\closeout 
(4-bit number). The specified output stream is opened or closed, for use in Write 
commands, as explained in Chapter 21. 

■ \immediate\write(number) (general text). The balanced text is written on 
the file that corresponds to the specified stream number, provided that such a file 
is open. Otherwise it is written on the user’s terminal and on the log file. (See 
Chapter 21; the terminal is omitted if the (number) is negative.) 

That completes the list of mode-independent commands, i.e., the commands 
that do not directly affect the lists that TpX is building. When T{rX is in vertical mode 
or internal vertical mode, it is constructing a vertical list; when T|X is in horizontal 
mode or restricted horizontal mode, it is constructing a horizontal list; when is 
in math mode or display math mode, it is constructing — guess what — a math list. In 
each of these cases we can speak of the “current list” ; and there are some commands 
that operate in essentially the same way, regardless of the mode, except that they deal 
with different sorts of lists: 

■ \openout(4-bit number) (equals) (filename), \closeout(4-bit number), Write 
(number) (general text). These commands are recorded into a “whatsit” item, which 
is appended to the current list. The command will be performed later, during any 
\shipout that applies to this list, unless the list is part of a box inside leaders. 

■ \special(general text). The balanced text is expanded and put into a “what- 
sit” item, which is appended to the current list. The text will eventually appear in the 
dvi file as an instruction to subsequent software (see Chapter 21). 

■ \penalty (number). A penalty item carrying the specified number is appended 
to the current list. In vertical mode, l{rX also exercises the page builder (see below). 

■ \kern(dimen), \mkern(mudimen). A kern item carrying the specified dimen- 
sion is appended to the current list. In vertical modes this denotes a vertical space; 
otherwise it denotes a horizontal space. An \mkern is allowed only in math modes. 

■ \unpenalty, \unkern, \unskip. If the last item on the current list is respec- 
tively of type penalty, kern, or glue (possibly including leaders), that item is removed 
from the list. However, like \lastbox, these commands are not permitted in vertical 
mode if the main vertical list-so-far has been entirely contributed to the current page, 
since T^^X never removes items from the current page. 

■ \mark(general text). The balanced text is expanded and put into a mark item, 
which is appended to the current list. The text may eventually become the replacement 
text for \topmark, \firstmark, \botmark, \splitf irstmark, and/or \splitbotmark, 
if this mark item ever gets into a vertical list. (Mark items can appear in horizontal 
lists and math lists, but they have no effect until they “migrate” out of their list. The 
migration process is discussed below and in Chapter 25.) 

■ \ insert (8-bit number) (filler) {(vertical mode material)}; the (8-bit number) 
must not be 255. The causes T)jX to enter internal vertical mode and a new 
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level of grouping. When the matching *}’ is sensed, the vertical list is put into an 
insertion item that is appended to the current list using the values of \splittopskip, 
\splitmaxdepth, and \floatingpenalty that were current in the group just ended. 
(See Chapter 15.) This insertion item leads ultimately to a page insertion only if it 
appears in TjrX’s main vertical list, so it will have to “migrate” there if it starts out in 
a horizontal list or a math list. also exercises the page builder (see below), after 
an \ insert has been appended in vertical mode. 

■ \vadjust (filler) {(vertical mode material)}. This is similar to \insert; the 
constructed vertical list goes into an adjustment item that is appended to the current 
list. However, \vadjust is not allowed in vertical modes. When an adjustment item 
migrates from a horizontal list to a vertical list, the vertical list inside the adjustment 
item is “unwrapped” and put directly into the enclosing list. 

* * * 

Almost everything we have discussed so far in this chapter could equally well have 
appeared in a chapter entitled “Summary of Horizontal Mode” or a chapter entitled 
“Summary of Math Mode,” because TjgX treats all of the commands considered so far 
in essentially the same way regardless of the current mode. Chapters 25 and 26 are 
going to be a lot shorter than the present one, since it will be unnecessary to repeat 
all of the mode-independent rules. 

But now we come to commands that are mode-dependent; we shall conclude 
this chapter by discussing what IjgX does with the remaining commands, when in 
vertical mode or internal vertical mode. 

One of the things characteristic of vertical mode is the page-building operation 
described in Chapter 15. TpX periodically takes material that lias been put on the 
main vertical list and moves it from the “contribution list” to the “current page.” At 
such times the output routine might be invoked. We shall say that TpX exercises the 
page builder whenever it tries to empty the current contribution list. The concept of 
contribution list exists only in the outermost vertical mode, so nothing happens when 
TgX exercises the page builder in internal vertical mode. 

Another thing characteristic of vertical modes is the interline glue that is 
inserted before boxes, based on the values of \prevdepth and \baselineskip and 
\lineskip and \lineskiplimit as explained in Chapter 12. If a command changes 
\prevdepth, that fact is specifically mentioned below. The \prevdepth is initially set 
to — 1000 pt, a special value that inhibits interline glue, whenever Tj?X begins to form 
a vertical list, except in the case of \halign and \noalign when the interline glue 
conventions of the outer list continue inside the inner one. 

■ \vskip(glue), \vfil, \vfill, \vss, \vfilneg. A glue item is appended to the 
current vertical list. 

■ (leaders) (box or rule) (vertical skip) . Here (vertical skip) refers to one of the 
five glue-appending commands just mentioned. The formal syntax for (leaders) and for 
(box or rule) is 

(leaders) — > \leaders | \cleaders | \xleaders 

(box or rule) — > (box) | (vertical rule) | (horizontal rule) 

(vertical rule) — \vrule(rule specification) 

(horizontal rule) — > \hrule(rule specification) 
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(rule specification) — ► (optional spaces) | (rule dimension) (rule specification) 
(rule dimension) — > width (dimen) | height (dimen) | depth (dimen) 

A glue item that produces leaders is appended to the current list. 

■ (space token). Spaces have no effect in vertical modes. 

■ (box). The box is constructed, and if the result is void nothing happens. 
Otherwise the current vertical list receives (1) interline glue, followed by (2) the new 
box, followed by (3) vertical material that migrates out of the new box (if the (box) 
was an \hbox command). Then \prevdepth is set to the new box’s depth, and TjgX 
exercises the page builder. 

■ \moveleft (dimen) (box), \moveright (dimen) (box). This acts exactly like an 
ordinary (box) command, but the new box that is appended to the vertical list is also 
shifted left or right by the specified amount. 

■ \unvbox(8-bit number), \unvcopy(8-bit number). If the specified box register 
is void, nothing happens. Otherwise that register must contain a vbox. The vertical 
list inside that box is appended to the current vertical list, without changing it in any- 
way. The value of \prevdepth is not affected. The box register becomes void after 
\unvbox, but it remains unchanged by \unvcopy. 

■ (horizontal rule). The specified rule is appended to the current list. Then 
\prevdepth is set to — 1000 pt; this will prohibit interline glue when the next box is 
appended to the list. 

■ \halign(box specification) {(alignment material)}. The (alignment material) 

consists of a preamble followed by zero or more lines to be aligned; see Chapter 22. 
T)?X enters a new level of grouping, represented by the ‘{’ and '}’, within which changes 
to \tabskip will be confined. The alignment material can also contain optional occur- 
rences of ‘\noalign(filler){(vertical mode material)}’ between lines; this adds another 
level of grouping. operates in internal vertical mode while it works on the ma- 

terial in \noalign groups and when it appends lines of the alignment; the resulting 
internal vertical list will be appended to the enclosing vertical list after the alignment 
is completed, and the page builder will be exercised. The value of \prevdepth at the 
time of the \halign is used at the beginning of the internal vertical list, and the fi- 
nal value of \prevdepth is carried to the enclosing vertical list when the alignment is 
completed, so that the interline glue is calculated properly at the beginning and end of 
the alignment. T}]X also enters an additional level of grouping when it works on each 
individual entry of the alignment, during which time it acts in restricted horizontal 
mode; the individual entries will be hboxed as part of the final alignment, and their 
vertical material will migrate to the enclosing vertical list. The commands \noalign, 
\omit, \span, \cr, \crcr, and ft (where ft denotes an explicit or implicit character of 
category 4) are intercepted by the alignment process, enroute to l{rX’s stomach, so 
they will not appear as commands in the stomach unless T|'X has lost track of what 
alignment they belong to. 

■ \indent. The \parskip glue is appended to the current list, unless TgX is in 
internal vertical mode and the current list is empty. Then T^jX enters unrestricted 
horizontal mode, starting the horizontal list with an empty hbox whose width is 
\parindent. The \everypar tokens are inserted into l{rX’s input. The page builder 
is exercised. When the paragraph is eventually completed, horizontal mode will come 
to an end as described in Chapter 25. 
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■ \noindent. This is exactly like \indent, except that TjjX starts out in hori- 
zontal mode with an empty list instead of with an indentation. 

■ \par. The primitive \par command has no effect when TjrX is in vertical 
mode, except that the page builder is exercised in case something is present on the 
contribution list, and the paragraph shape parameters are cleared. 

■ {. A character token of category 1, or a control sequence like \bgroup that 
has been \let equal to such a character token, causes TjjX to start a new level of 
grouping. When such a group ends — with “}’ — TjrX will undo the effects of non-global 
assignments without leaving whatever mode it is in at that time. 

■ Some commands are incompatible with vertical mode because they are intrin- 
sically horizontal. When the following commands appear in vertical modes they cause 
TjgX to begin a new paragraph: 

(horizontal command) — > (letter) | (otherchar) | \char | (chardef token) 

| \noboundary | \unhbox | \unhcopy | \valign | \vrule 
| \hskip | \hfil | \hfill | \hss | \hfilneg 
| \accent | \discretionary | \- | \ u | $ 

Here (letter) and (otherchar) stand for explicit or implicit character tokens of categories 
11 and 12. If any of these tokens occurs as a command in vertical mode or internal 
vertical mode, TjjX automatically performs an \indent command as explained above. 
This leads into horizontal mode with the \everypar tokens in the input, after which 
T)?X will see the (horizontal command) again. 

■ \end. This command is not allowed in internal vertical mode. In regular 
vertical mode it terminates Tj^X if the main vertical list is empty and \deadcycles=0. 
Otherwise Tj?X backs up the \end command so that it can be read again; then it 
exercises the page builder, after appending a box/glue/penalty combination that will 
force the output routine to act. (See the end of Chapter 23.) 

■ \dump. (Allowed only in INITEX, not in production versions of TjrX.) This 
command is treated exactly like \end, but it must not appear inside a group. It 
outputs a format file that can be loaded into TjjX’s memory at comparatively high 
speed to restore the current status. 

■ None of the above: If any other primitive command of TjjX occurs in vertical 
mode, an error message will be given, and Tj^X will try to recover in a reasonable way. 
For example, if a superscript or subscript symbol appears, or if any other inherently 
mathematical command is given, TjgX will try to insert a “$’ (which will start a para- 
graph and enter math mode). On the other hand if a totally misplaced token like 
\endcsname or \omit or \eqno or # appears in vertical mode, TjgX will simply ignore it, 
after reporting the error. You might enjoy trying to type some really stupid input, just 
to see what happens. (Say ‘\tracingall’ first, as explained in Chapter 27, in order 
to get maximum information.) 

The first and most striking feature is the Verticality of composition, 
as opposed to the Horizontatity of all anterior structural modes. 

— COCKBURN MUIR, Pagan or Christian? (1860) 
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Continuing the survey that was begun in Chapter 24, let us investigate exactly 
what TgX’s digestive processes can do, when TgX is building lists in horizontal 
mode or in restricted horizontal mode. 

* * * 

Three asterisks, just like those that appear here, can be found near the end of Chap- 
ter 24. Everything preceding the three asterisks in that chapter applies to horizontal 
mode as well as to vertical mode, so we need not repeat all those rules. In particular, 
Chapter 24 explains assignment commands, and it tells how kerns, penalties, marks, 
insertions, adjustments, and “wliatsits” are put into horizontal lists. Our present goal 
is to consider the commands that have an intrinsically horizontal flavor, in the sense 
that they behave differently in horizontal mode than they do in vertical or math modes. 

One of the things characteristic of horizontal mode is the “space factor,” which 
modifies the width of spaces as described in Chapter 12. If a command changes the 
value of \spacefactor, that fact is specifically noted here. The space factor is initially 
set to 1000, when TjjX begins to form a horizontal list, except in the case of \valign 
and \noalign when the space factor of the outer list continues inside the inner one. 

■ \hskip{glue), \hfil, \hfill, \hss, \hfilneg. A glue item is appended to the 
current horizontal list. 

■ (leaders) (box or rule) (horizontal skip). Here (horizontal skip) refers to one of 
the five glue-appending commands just mentioned; the formal syntax for (leaders) and 
for (box or rule) is given in Chapter 24. A glue item that produces leaders is appended. 

■ (space token). Spaces append glue to the current list; the exact amount of 
glue depends on \spacef actor, the current font, and the \spaceskip and \xspaceskip 
parameters, as described in Chapter 12. 

■ \ u . A control-space command appends glue to the current list, using the same 
amount that a (space token) inserts when the space factor is 1000. 

■ (box). The box is constructed, and if the result is void nothing happens. Oth- 
erwise the new box is appended to the current list, and the space factor is set to 1000. 

■ \raise(dimen)(box), Mower (dimen) (box). This acts exactly like an ordinary 
(box) command, but the new box that is appended to the horizontal list is also shifted 
up or down by the specified amount. 

■ \unhbox(8-bit number), \unhcopy(8-bit number). If the specified box register 
is void, nothing happens. Otherwise that register must contain an hbox. The horizontal 
list inside that box is appended to the current horizontal list, without changing it in 
any way. The value of \spacefactor is not affected. The box register becomes void 
after \unhbox, but it remains unchanged by \unhcopy. 

■ (vertical rule). The specified rule is appended to the current list, and the 
\spacefactor is set to 1000. 

■ \valign(box specification) {(alignment material)}. The (alignment material) 
consists of a preamble followed by zero or more columns to be aligned; see Chap- 
ter 22. IjjX enters a new level of grouping, represented by the and “}’, within which 
changes to \tabskip will be confined. The alignment material can also contain op- 
tional occurrences of ‘\noalign(filler){(liorizontal mode material)}’ between columns; 
this adds another level of grouping. lj?X operates in restricted horizontal mode while 
it works on the material in \noalign groups and when it appends columns of the 
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alignment; the resulting internal horizontal list will be appended to the enclosing hor- 
izontal list after the alignment is completed. The value of \spacefactor at the time 
of the \valign is used at the beginning of the internal horizontal list, and the final 
value of \spacefactor is carried to the enclosing horizontal list when the alignment 
is completed. The space factor is set to 1000 after each column; hence it affects the 
results only in \noalign groups. TpX also enters an additional level of grouping when 
it works on each individual entry of the alignment, during which time it acts in internal 
vertical mode; the individual entries will be vboxed as part of the final alignment. 

■ \indent. An empty box of width \parindent is appended to the current list, 
and the space factor is set to 1000. 

■ \no indent. This command has no effect in horizontal modes. 

■ \par. The primitive \par command, also called \endgraf in plain TjjX, does 
nothing in restricted horizontal mode. But it terminates horizontal mode: The current 
list is finished off by doing \unskip XpenaltylOOOO \hskip\parf illskip, then it is 
broken into lines as explained in Chapter 14, and IjgX returns to the enclosing vertical or 
internal vertical mode. The lines of the paragraph are appended to the enclosing vertical 
list, interspersed with interline glue and interline penalties, and with the migration of 
vertical material that was in the horizontal list. Then TjrX exercises the page builder. 

■ {. A character token of category 1, or a control sequence like \bgroup that 
has been \let equal to such a character token, causes IjgX to start a new level of 
grouping. When such a group ends — with *}’ — TjgX will undo the effects of non-global 
assignments without leaving whatever mode it is in at that time. 

■ Some commands are incompatible with horizontal mode because they are in- 
trinsically vertical. When the following commands appear in unrestricted horizontal 
mode, they cause Tj?X to conclude the current paragraph: 

(vertical command) — > \unvbox | \unvcopy | \halign | \hrule 
| \vskip | \vfil | \vfill | \vss | \vfilneg | \end | \dump 

The appearance of a (vertical command) in restricted horizontal mode is forbidden, but 
in regular horizontal mode it causes TpX to insert the token |par | into the input; after 
reading and expanding this |par | token, Tj?X will see the (vertical command) token 
again. (The current meaning of the control sequence \par will be used; |par | might 
no longer stand for TjLX’s \par primitive. ) 

■ (letter), (otherchar), \char (8-bit number), (chardef token), \noboundary. The 
most common commands of all are the character commands that tell Tj^X to append a 
character to the current horizontal list, using the current font. If two or more commands 
of this type occur in succession, TpX processes them all as a unit, converting to ligatures 
and/or inserting kerns as directed by the font information. (Ligatures and kerns may be 
influenced by invisible “boundary” characters at the left and right, unless \noboundary 
appears.) Each character command adjusts \spacefactor, using the \sfcode table as 
described in Chapter 12. In unrestricted horizontal mode, a ‘\discretionary{}{}{}’ 
item is appended after a character whose code is the \hyphenchar of its font, or after 
a ligature formed from a sequence that ends with such a character. 

■ \accent(8-bit number) (optional assignments). Here (optional assignments) 
stands for zero or more (assignment) commands. If the assignments are not followed 
by a (character), where (character) stands for any of the commands just discussed in 
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the previous paragraph, IjjX treats \accent as if it were \char, except that the space 
factor is set to 1000. Otherwise the character that follows the assignment is accented by 
the character that corresponds to the (8-bit number). (The purpose of the intervening 
assignments is to allow the accenter and accentee to be in different fonts. ) If the accent 
must be moved up or down, it is put into an hbox that is raised or lowered. Then the 
accent is effectively superposed on the character by means of kerns, in such a way that 
the width of the accent does not influence the width of the resulting horizontal list. 
Finally, T^X sets \spacef actor=1000. 

■ \/. If the last item on the current list is a character or ligature, an explicit 
kern for its italic correction is appended. 

■ \discretionary(general text)(general text) (general text). The three general 
texts are processed in restricted horizontal mode. They should contain only fixed-width 
things; hence they aren’t really very general in this case. More precisely, the horizontal 
list formed by each discretionary general text must consist only of characters, ligatures, 
kerns, boxes, and rules; there should be no glue or penalty items, etc. This command 
appends a discretionary item to the current list; see Chapter 14 for the meaning of a 
discretionary item. The space factor is not changed. 

■ \-. This “discretionary hyphen” command is defined in Appendix H. 

■ \setlanguage(number). See the conclusion of Appendix H. 

■ $. A “math shift” character causes TjjX to enter math mode or display math 
mode in the following way: T^]X looks at the following token without expanding it. If 
that token is a $ and if TjzX is currently in unrestricted horizontal mode, then TjzX 
breaks the current paragraph into lines as explained above (unless the current list is 
empty), returns to the enclosing vertical mode or internal vertical mode, calculates 
values like \prevgraf and \displaywidth and \predisplaysize, enters a new level of 
grouping, inserts the \everydisplay tokens into the input, exercises the page builder, 
processes ‘(math mode material) $$’ in display math mode, puts the display into the 
enclosing vertical list as explained in Chapter 19 (letting vertical material migrate), 
exercises the page builder again, increases \prevgraf by 3, and resumes horizontal 
mode again, with an empty list and with the space factor equal to 1000. (You got 
that?) Otherwise TjjX puts the looked-at token back into the input, enters a new level 
of grouping, inserts the \everymath tokens, and processes ‘(math mode material)$’; the 
math mode material is converted to a horizontal list and appended to the current list, 
surrounded by “math-on” and “math-off” items, and the space factor is set to 1000. 
One consequence of these rules is that ‘$$’ in restricted horizontal mode simply yields 
an empty math formula. 

■ None of the above: If any other primitive command of IjgX occurs in horizontal 
mode, an error message will be given, and IjjX will try to recover in a reasonable way. 
For example, if a superscript or subscript symbol appears, or if any other inherently 
mathematical command is given, IjjX will try to insert a ‘$’ just before the offending 
token; this will enter math mode. 

Otherwise. You may reduce all Verticals into Horizontals. 

— JOSEPH MOXON, A Tutor to Astronomie and Geographie (1659) 
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! You can’t use ‘\moveleft’ in horizontal mode. 

— T e X (1982) 
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To conclude the survey that was begun in Chapter 24, let us investigate exactly 
what Tj^X’s digestive processes can do when Tf^X is building lists in math mode 
or in display math mode. 


* * * 

Three asterisks, just like those that appear here, can be found near the end of Chap- 
ter 24. Everything preceding the three asterisks in that chapter applies to math mode as 
well as to vertical mode, so we need not repeat all those rules. In particular, Chapter 24 
explains assignment commands, and it tells how kerns, penalties, marks, insertions, ad- 
justments, and “whatsits” are put into math lists. Our present goal is to consider the 
commands that have an intrinsically mathematical flavor, in the sense that they behave 
differently in math mode than they do in vertical or horizontal modes. 

Math lists are somewhat different from TjrX’s other lists because they contain 
tliree-pronged “atoms” (see Chapter 17). Atoms come in thirteen flavors: Ord, Op, 
Bin, Rel, Open, Close, Punch, Inner, Over, Under, Acc, Rad, and Vcent. Each atom 
contains three “fields” called its nucleus, superscript, and subscript; and each field is 
either empty or is filled with a math symbol, a box, or a subsidiary math list. Math 
symbols, in turn, have two components: a family number and a position number. 

It’s convenient to introduce a few more rules of syntax, in order to specify 
what goes into a math list: 

(character) — > (letter) | (otherchar) | \ char (8-bit number) | (chardef token) 
(math character) — > \mathchar(15-bit number) | (mathchardef token) 

| \delimiter(27-bit number) 

(math symbol) — > (character) | (math character) 

(math field) — > (math symbol) | (filler) {(math mode material)} 

(delim) — > (filler) \delimiter(27-bit number) 

| (filler) (letter) | (filler) (otherchar) 

We have already seen the concept of (character) in Chapter 25. Indeed, characters 
are Tf.'X 's staple food: The vast majority of all commands that reach TjrX’s digestive 
processes in horizontal mode are instances of the (character) command, which specifies 
a number between 0 and 255 that causes TjXX to typeset the corresponding character 
in the current font. When IjgX is in math mode or display math mode, a (character) 
command takes on added significance: It specifies a number between 0 and 32767 = 
2 la — 1. This is done by replacing the character number by its \mathcode value. If the 
\mathcode value turns out to be 32768 = "8000, however, the (character) is replaced 
by an active character token having the original character code (0 to 255); TyX forgets 
the original (character) and expands this active character according to the rules of 
Chapter 20. 

A (math character) defines a 15-bit number either by specifying it directly 
with \mathchar or in a previous \mathchardef , or by specifying a 27-bit \delimiter 
value; in the latter case, the least significant 12 bits are discarded. 

It follows that every (math symbol), as defined by the syntax above, specifies 
a 15-bit number, i.e. , a number between 0 and 32767. Such a number can be repre- 
sented in the form 4096c -f 256/ + a, where 0 < c < 8, 0 < / < 16, and 0 < a < 256. 
If c = 7, IjjX changes c to 0; and in this case if the current value of \fam is between 
0 and 15, TjjX also replaces / by \fam. This procedure yields, in all cases, a class 
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number c between 0 and 6, a family number / between 0 and 15, and a position num- 
ber a between 0 and 255. (Tj/X initializes the value of \fam by implicitly putting the 
assignment ‘\fam=-l’ at the very beginning of \everymath and \everydisplay. Thus, 
the substitution of \fam for / will occur only if the user lias explicitly changed \fam 
within the formula.) 

A (math field) is used to specify the nucleus, superscript, or subscript of an 
atom. When a (math field) is a (math symbol), the / and a numbers of that sym- 
bol go into the atomic field. Otherwise the (math field) begins with a which 
causes TfjX to enter a new level of grouping and to begin a new math list; the en- 
suing (math mode material) is terminated by a at which point the group ends and 
the resulting math list goes into the atomic field. If the math list turns out to be simply 
a single Ord atom without subscripts or superscripts, or an Acc. whose nucleus is an 
Ord, the enclosing braces are effectively removed. 

A (delim) is used to define both a “small character” a in family / and a “large 
character” b in family g, where 0 < a, b < 255 and 0 < /, g < 15; these character 
codes are used to construct variable-size delimiters, as explained in Appendix G. If the 
(delim) is given explicitly in terms of a 27-bit number, the desired codes are obtained 
by interpreting that number as c • 2 24 + / • 2 20 + a ■ 2 12 + g ■ 2 s + b, ignoring the value 
of c. Otherwise the delimiter is specified as a (letter) or (otherchar) token, and the 
24-bit \delcode value of that character is interpreted as / • 2 20 + a ■ 2 12 + g ■ 2 s + b. 

Now let's study the individual commands as TjjX obeys them in math mode, 
considering first the ones that have analogs in vertical and/or horizontal mode: 

■ \hskip(glue), \hfil, \hfill, \hss, \hfilneg, \mskip(muglue). A glue item 
is appended to the current math list. 

■ (leaders) (box or rule) (mathematical skip). Here (mathematical skip) refers to 
one of the six glue-appending commands just mentioned; the formal syntax for (leaders) 
and for (box or rule) is given in Chapter 24. A glue item that produces leaders is 
appended to the current list. 

■ \nonscript. A special glue item of width zero is appended; it will have 
the effect of cancelling the following item on the list, if that item is glue and if the 
\nonscript is eventually typeset in “script style” or in “scriptscript style.” 

■ \noboundary. This command is redundant and therefore has no effect; bound- 
ary ligatures are automatically disabled in math modes. 

■ (space token). Spaces have no effect in math modes. 

■ \ u . A control-space command appends glue to the current list, using the same 
amount that a (space token) inserts in horizontal mode when the space factor is 1000. 

■ (box). The box is constructed, and if the result is void nothing happens. 
Otherwise a new Ord atom is appended to the current math list, and the box becomes 
its nucleus. 

■ \raise(dimen)(box), Mower (dimen) (box). This acts exactly like an ordinary 
(box) command, but the new box that is put into the nucleus is also shifted up or down 
by the specified amount. 

■ \vcenter(box specification) {(vertical mode material)}. A vbox is formed as 
if ‘\vcenter’ had been ‘\vbox’. Then a new Vcent atom is appended to the current 
math list, and the box becomes its nucleus. 
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■ (vertical rule). A rule is appended to the current list (not as an atom). 

■ \halign(box specification) {{alignment material)}. This command is allowed 
only in display math mode, and only when the current math list is empty. The align- 
ment is carried out exactly as if it were done in the enclosing vertical mode (see Chap- 
ter 24), except that the lines are shifted right by the \displayindent. The closing 
“}’ may be followed by optional (assignment) commands, after which “$$’ must con- 
clude the display. TfrX will insert the \abovedisplayskip and \belowdisplayskip glue 
before and after the result of the alignment. 

■ \indent. An empty box of width \parindent is appended to the current list, 
as the nucleus of a new Ord atom. 

■ \no indent. This command has no effect in math modes. 

■ {{math mode material)}. A character token of category 1, or a control se- 
quence like \bgroup that has been \let equal to such a character token, causes TjrX 
to start a new level of grouping and also to begin work on a new math list. When such 
a group ends — with “}’ — TjjX uses the resulting math list as the nucleus of a new Ord 
atom that is appended to the current list. If the resulting math list is a single Acc 
atom, however (i.e., an accented quantity), that atom itself is appended. 

■ (math symbol). (This is the most common command in math mode; see the 
syntax near the beginning of this chapter.) A math symbol determines three values, 
c, /, and a, as explained earlier. TjjX appends an atom to the current list, where the 
atom is of type Ord, Op, Bin, Rel, Open, Close, or Punct, according as the value of c is 
0, 1, 2, 3, 4, 5, or 6. The nucleus of this atom is the math symbol defined by / and a. 

■ (math atom) (math field). A (math atom) command is any of the following: 

\mathord | \mathop | \mathbin | \mathrel | \mathopen 

| \mathclose | \mathpunct | \mathinner | \underline | \overline 

TjgX processes the (math field), then appends a new atom of the specified type to the 
current list; the nucleus of this atom contains the specified field. 

■ \mathaccent(15-bit number)(math field). TjjX converts the (15-bit number) 
into c, /, and a as it does with any \mathchar. Then it processes the (math field) and 
appends a new Acc atom to the current list. The nucleus of this atom contains the 
specified field; the accent character in this atom contains (a,/). 

■ \radical(27-bit number)(math field). TjjX converts the (27-bit number) into 
a, /, b, and g as it does with any \delimiter. Then it processes the (math field) and 
appends a new Rad atom to the current list. The nucleus of this atom contains the 
specified field; the delimiter field in this atom contains (a,/) and (6, g). 

■ (superscript) (math field). A (superscript) command is an explicit or implicit 
character token of category 7. If the current list does not end with an atom, a new Ord 
atom with all fields empty is appended; thus the current list will end with an atom, 
in all cases. The superscript field of this atom should be empty; it is made nonempty 
by changing it to the result of the specified (math field). 

■ (subscript) (math field). A (subscript) command is an explicit or implicit char- 
acter token of category 8. It acts just like a (superscript) command, except, of course, 
that it affects the subscript field instead of the superscript field. 
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■ \displaylimits, \limits, \nolimits. These commands are allowed only if 
the current list ends with an Op atom. They modify a special field in that Op atom, 
specifying what conventions should be used with respect to limits. The normal value 
of that field is \displaylimits. 

■ \/. A kern of width zero is appended to the current list. (This will have the 
effect of adding the italic correction to the previous character, if the italic correction 
wouldn’t normally have been added.) 

■ \discretionary(general text)(general text) (general text). This command is 
treated just as in horizontal mode (see Chapter 25), but the third (general text) must 
produce an empty list. 

■ \-. This command is usually equivalent to l \discretionary{-}{}{}’; the 
is therefore interpreted as a hyphen, not as a minus sign. (See Appendix H.) 

■ \mathchoice(general text) (general text) (general text) (general text). The four 
general texts are each treated as subformulas (i.e., like the second alternative in the 
definition of (math field)). The four math lists defined in this way are recorded in a 
“choice item” that is appended to the current list. 

■ \displaystyle, \textstyle, \scriptstyle, \scriptscriptstyle. A style- 
change item that corresponds to the specified style is appended to the current list. 

■ \left(delim)(math mode material) Xright (delim). IjrX begins a new group, 
and processes the (math mode material) by starting out with a new math list that 
begins with a left boundary item containing the first delimiter. This group must be 
terminated by Aright’, at which time the internal math list is completed with a right 
boundary item containing the second delimiter. Then TjgX appends an Inner atom to 
the current list; the nucleus of this atom contains the internal math list. 

■ (generalized fraction command). This command takes one of six forms: 

\over | \atop | \above(dimen) 

| \overwithdelims (delim) (delim) 

| \atopwithdelims (delim) (delim) 

| \abovewithdelims(delim) (delim) (dimen) 

(See Chapter 17.) When TjrX sees a (generalized fraction command) it takes the entire 
current list and puts it into the numerator field of a generalized fraction item. The 
denominator field of this new item is temporarily empty; the left and right delimiter 
fields are set equal to the specified delimiter codes. IjgX saves this generalized fraction 
item in a special place associated with the current level of math mode processing. 
(There should be no other generalized fraction item in that special place, because 
constructions like ‘a\over b\over c’ are illegal.) Then TjrX makes the current list 
empty and continues to process commands in math mode. Later on, when the current 
level of math mode is completed (either by coming to a *$’ or a or a \right, 
depending on the nature of the current group), the current list will be moved into the 
denominator field of the generalized fraction item that was saved; then that item, all by 
itself, will take the place of the entire list. However, in the special case that the current 
list began with \left and will end with \right, the boundary items will be extracted 
from the numerator and denominator of the generalized fraction, and the final list will 
consist of three items: left boundary, generalized fraction, right boundary. (If you 
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want to watch the process by which math lists are built, you might find it helpful to 
type ‘\showlists 1 while TpX is processing the denominator of a generalized fraction.) 

■ (eqno) (math mode material)!. Here (eqno) stands for either \eqno or \leqno; 
these commands are allowed only in display math mode. Upon reading (eqno), TjrX 
enters a new level of grouping, inserts the \everymath tokens, and enters non-display 
math mode to put the (math mode material) into a math list. When that math list is 
completed, IjrX converts it to a horizontal list and puts the result into a box that will 
be used as the equation number of the current display. The closing $ token will be put 
back into the input, where it will terminate the display. 

■ $. If T|-'X is in display math mode, it reads one more token, which must also 
be $. In either case, the math-shift command terminates the current level of math 
mode processing and ends the current group, which should have begun with either $ 
or (eqno). Once the math list is finished, it is converted into a horizontal list as 
explained in Appendix G. 

■ None of the above: If any other primitive command of IjrX occurs in math 
mode, an error message will be given, and TjjX will try to recover in a reasonable way. 
For example, if a \par command appears, or if any other inherently non-mathematical 
command is given, TjgX will try to insert a '$’ just before the offending token; this will 
lead out of math mode. On the other hand if a totally misplaced token like \endcsname 
or \omit or # appears in math mode, will simply ignore it, after reporting the error. 
You might enjoy trying to type some really stupid input, just to see what happens. (Say 
‘\tracingall’ first, as explained in Chapter 27, in order to get maximum information.) 

(£)(£)*■ EXERCISE 26.1 

JL JL Powers of ten: The whole IjjX language has now been summarized completely. 
To demonstrate how much you know, name all of the ways you can think of in which 
the numbers 10, 100, 1000, 10000, and 100000 have special significance to TjfjX. 



► EXERCISE 26.2 

Powers of two: Name all of the ways you can think of in which the numbers 


8, 16, 32, 64, 128, 256, . . . have special significance to TjgX. 
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Mathematics is known in the trade as difficult, or penalty, copy 
because it is slower, more difficult, and more expensive to set in type 
than any other kind of copy normally occurring in books and journals. 
— UNIVERSITY OF CHICAGO PRESS, A Manual of Style (1969) 

The tale of Math is a complex one, 
and it resists both a simple plot summary 
and a concise statement of its meaning. 
— PATRICK K. FORD, The Mabinogi (1977) 
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OK, everything you need to know about T^X has been explained — unless you 
happen to be fallible. If you don’t plan to make any errors, don’t bother to read 
this chapter. Otherwise you might find it helpful to make use of some of the 
ways that TgX tries to pinpoint bugs in your manuscript. 

In the trial runs you did when reading Chapter 6, you learned the general 
form of error messages, and you also learned the various ways in which you can 
respond to TgX’s complaints. With practice, you will be able to correct most 
errors “online,” as soon as T^X has detected them, by inserting and deleting a 
few things. The right way to go about this is to be in a mellow mood when 
you approach T^X, and to regard the error messages that you get as amusing 
puzzles — “Why did the machine do that?” — rather than as personal insults. 

TgX knows how to issue more than a hundred different sorts of error 
messages, and you probably never will encounter all of them, because some 
types of mistakes are very hard to make. We discussed the “undefined control 
sequence” error in Chapter 6; let’s take a look at a few of the others now. 

If you misspell the name of some unit of measure — for example, if you 
type ‘\hsize=4im’ instead of ‘\hsize=4in’ — you’ll get an error message that 
looks something like this: 

! Illegal unit of measure (pt inserted) . 

<to be read again> 

i 

<to be read again> 

m 

<*> \hsize=4im 

\ input story 

TgX needs to see a legal unit before it can proceed; so in this case it has implicitly 
inserted ‘pt’ at the current place in the input, and it has set \hsize=4pt. 

What’s the best way to recover from such an error? Well, you should 
always type ‘H’ or ‘h’ to see the help message, if you aren’t sure what the error 
message means. Then you can look at the lines of context and see that TgX 
will read ‘ i ’ and then ‘m’ and then 1 \ input story ’ if you simply hit (return) 
and carry on. Unfortunately, this easy solution isn’t very good, because the ‘i’ 
and ‘m’ will be typeset as part of the text of a new paragraph. A much more 
graceful recovery is possible in this case, by first typing ‘2’. This tells TgX to 
discard the next two tokens that it reads; and after T^X has done so, it will stop 
again in order to give you a chance to look over the new situation. Here is what 
you will see: 

<recently read> m 

<*> \hsize=4im 


error messages 
Illegal unit 
recover 


? 


\ input story 
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Good; the ‘i’ and ‘m’ are read and gone. But if you hit (return) now, TgX will 
‘\input story’ and try to typeset the story.tex file with \hsize=4pt; that 
won’t be an especially exciting experiment, because it will simply produce dozens 
of overfull boxes, one for every syllable of the story. Once again there’s a better 
way: You can insert the command that you had originally intended, by typing 

I\hsize=4in 


now. This instructs TgX to change \hsize to the correct value, after which it 
will \ input story and you’ll be on your way. 

► EXERCISE 27.1 

Ben User typed ‘8’, not ‘2’, in response to the error message just considered; his 
idea was to delete ‘ i ’, ‘m’, ‘\ input’, and the five letters of ‘story’. But Tf^X’s 
response was 

<*> \hsize=4im \input stor 

y 

Explain what happened. 


TjrjX usually tries to recover from errors either by ignoring a command 
that it doesn’t understand, or by inserting something that will keep it happy. For 
example, we saw in Chapter 6 that Tj^X ignores an undefined control sequence; 
and we just observed that T^X inserts ‘pt’ when it needs a physical unit of 
measure. Here’s another example where TgX puts something in: 

! Missing $ inserted. 

<inserted text> 


User 

help message 
Missing 


<to be read again> 

1.11 the fact that 32768=2~ 

{15} wasn’t interesting 

? H 

I’ve inserted a begin-math/end-math symbol since I think 
you left one out. Proceed, with fingers crossed. 

(The user has forgotten to enclose a formula in $ signs, and TgX has tried to 
recover by inserting one.) In this case the (inserted text) is explicitly shown, 
and it has not yet been read; by contrast, our previous example illustrated a 
case where TpX had already internalized the ‘pt’ that it had inserted. Thus the 
user has a chance here to remove the inserted before TgX really sees it. 

What should be done? The error in this example occurred before TgjX 
noticed anything wrong; the characters ‘32768=2’ have already been typeset in 
horizontal mode. There’s no way to go back and cancel the past, so the lack of 
proper spacing around the ‘=’ cannot be fixed. Our goal of error recovery in this 
case is therefore not to produce perfect output; we want rather to proceed in some 
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way so that TgX will pass by the present error and detect subsequent ones. If we 
were simply to hit (return) now, our aim would not be achieved, because TgX 
would typeset the ensuing text as a math formula: a5 wasn'tinteresting . . .’ ; 
another error would be detected when the paragraph is found to end before any 
closing has appeared. On the other hand, there’s a more elaborate way to 
recover, namely to type ‘6’ and then T$~{15}$’; this deletes ‘$~{15}’ and inserts 
a correct partial formula. But that’s more complicated than necessary. The best 
solution in this case is to type just ‘2’ and then go on; TgX will typeset the 
incorrect equation ‘32768=215’, but the important thing is that you will be able 
to check out the rest of the document as if this error hadn’t occurred. 



► EXERCISE 27.2 

Here’s a case in which a backslash was inadvertently omitted: 
! Missing control sequence inserted. 

Cinserted text> 

\inaccessible 
<to be read again> 

m 

1.10 \def m 

acrolreplacement} 


TgX needs to see a control sequence after ‘\def ’, so it has inserted one that will allow 
the processing to continue. (This control sequence is shown as ‘\inaccessible’, but 
it has no relation to any control sequence that you can actually specify in an error-free 
manuscript.) If you simply hit (return) at this point, TjrX will define the inaccessible 
control sequence, but that won’t do you much good; later references to \macro will be 
undefined. Explain how to recover from this error so that the effect will be the same 
as if line 10 of the input file had said ‘\def\macro{replacement}’. 

/ ►EXERCISE 27.3 

JL When you use the ‘I’ option to respond to an error message, the rules of 
Chapter 8 imply that TjrX removes all spaces from the right-hand end of the line. 
Explain how you can use the ‘I’ option to insert a space, in spite of this fact. 


Some of the toughest errors to deal with are those in which you make 
a mistake on line 20 (say), but TpjX cannot tell that anything is amiss until it 
reaches line 25 or so. For example, if you forget a ‘}’ that completes the argu- 
ment to some macro, T^X won’t notice any problem until reaching the end of 
the next paragraph. In such cases you probably have lost the whole paragraph; 
but TpjX will usually be able to get straightened out in time to do the subse- 
quent paragraphs as if nothing had happened. A “runaway argument” will be 
displayed, and by looking at the beginning of that text you should be able to 
figure out where the missing ! }’ belongs. 

It’s wise to remember that the first error in your document may well 
spawn spurious “errors” later on, because anomalous commands can inflict seri- 
ous injury on TgX’s ability to cope with the subsequent material. But most of 
the time you will find that a single run through the machine will locate all of 
the places in which your input conflicts with TgX’s rules. 
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When your error is due to misunderstanding rather than mistyping, the 
situation is even more serious: TeX’s error messages will probably not be very 
helpful, even if you ask TgX for help. If you have unknowingly redefined an 
important control sequence — for example, if you have said ‘\def \box{ . . . }’ — all 
sorts of strange disasters might occur. Computers aren’t clairvoyant, and TeX 
can only explain what looks wrong from its own viewpoint; such an explanation 
is bound to be mysterious unless you can understand the machine’s attitude. 
The solution to this problem is, of course, to seek human counsel and advice; or, 
as a last resort, to read the instructions in Chapters 2, 3, . . . , 26. 



► EXERCISE 27.4 

J. H. Quick (a student) once defined the following set of macros: 

\newcount\serialnumber 
\def\f irstnumber{\serialnumber=0 } 
\def\nextnumber{\advance \serialnumber by 1 
\number\serialnumber) \nobreak\hskip . 2em } 


Thus he could type, for example, 

\f irstnumber 

\nextnumber xx , \nextnumber yy , and \nextnumber zz 

and T^X would typeset T) xx, 2) yy, and 3) zz’. Well, this worked fine, and he showed 
the macros to his buddies. But several months later he received a frantic phone call; 
one of his friends had just encountered a really weird error message: 

! Missing number, treated as zero. 

<to be read again> 

c 

1.107 \nextnumber minusc 

ule chances of error 

? 


Quick 
weird error 


Explain what happened, and advise Quick what to do. 

Sooner or later — hopefully sooner — you’ll get TeX to process your whole 
file without stopping once to complain. But maybe the output still won’t be right; 
the mere fact that TeX didn’t stop doesn’t mean that you can avoid proofreading. 
At this stage it’s usually easy to see how to fix typographic errors by correcting 
the input. Errors of layout can be overcome by using methods we have discussed 
before: Overfull boxes can be cured as described in Chapter 6; bad breaks can 
be avoided by using ties or \hbox commands as discussed in Chapter 14; math 
formulas can be improved by applying the principles of Chapters 16-19. 

But your output may contain seemingly inexplicable errors. For exam- 
ple, if you have specified a font at some magnification that is not supported by 
your printing software, TeX will not know that there is any problem, but the 
program that converts your dvi file to hardcopy might not tell you that it has 
substituted an “approximate” font for the real one; the resultant spacing may 
look quite horrible. 
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If you can’t find out what went wrong, try the old trick of simplifying 
your program: Remove all the things that do work, until you obtain the shortest 
possible input file that fails in the same way as the original. The shorter the file, 
the easier it will be for you or somebody else to pinpoint the problem. 

Perhaps you’ll wonder why TpX didn’t put a blank space in some posi- 
tion where you think you typed a space. Remember that TpX ignores spaces that 
follow control words, when it reads your file. (TpX also ignores a space after a 
(number) or a (unit of measure) that appears as an argument to a primitive com- 
mand; but if you are using properly designed macros, such rules will not concern 
you, because you will probably not be using primitive commands directly.) 

/gb/gb On the other hand, if you are designing macros, the task of troubleshooting 
JL JL can be a lot more complicated. For example, you may discover that TRX lias 
emitted three blank spaces when it processed some long sequence of complicated code, 
consisting of several dozen commands. How can you find out where those spaces crept 
in? The answer is to set ‘\tracingcoiranands=l’, as mentioned in Chapter 13. This 
tells TgX to put an entry in your log file whenever it begins to execute a primitive 
command; you’ll be able to see when the command is ‘blank space’. 


f Most implementations of TRX allow you to interrupt the program in some way. 

This makes it possible to diagnose the causes of infinite loops. TgX switches to 
\errorstopmode when interrupted; lienee you have a chance to insert commands into 
the input: You can abort the run, or you can \show or change the current contents of 
control sequences, registers, etc. You can also get a feeling for where TRX is spending 
most of its time, if you happen to be using an inefficient macro, since random interrupts 
will tend to occur in whatever place TRX visits most often. 


f Sometimes an error is so bad that TjjX is forced to quit prematurely. For 
example, if you are running in \batchmode or \nonstopmode, TRX makes an 
“emergency stop” if it needs input from the terminal; this happens when a necessary 
file cannot be opened, or when no \end command was found in the input document. 
Here are some of the messages you might get just before TRX gives up the ghost: 


Fatal format file error; I’m stymied. 

This means that the preloaded format you have specified cannot be used, because it 
was prepared for a different version of TRX. 


That makes 100 errors; please try again. 

TRX has scrolled past 100 errors since the last paragraph ended, so it’s probably in 
an endless loop. 

Interwoven alignment preambles are not allowed. 

If you have been so devious as to get this message, you will understand it, and you will 
deserve no sympathy. 


I can’t go on meeting you like this. 

A previous error has gotten TRX out of whack. Fix it and try again. 

This can’t happen. 

Something is wrong with the TjjX you are using. Complain fiercely. 
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There’s also a dreadful message that TgX issues only with great reluctance. 
But it can happen: 

TeX capacity exceeded, sorry. 

This, alas, means that you have tried to stretch IgX too far. The message will tell 
you what part of TjgX’s memory has become overloaded; one of the following fourteen 
things will be mentioned: 

number of strings (names of control sequences and files) 
pool size (the characters in such names) 

main memory size (boxes, glue, breakpoints, token lists, characters, etc.) 

hash size (control sequence names) 

font memory (font metric data) 

exception dictionary (hyphenation exceptions) 

input stack size (simultaneous input sources) 

semantic nest size (unfinished lists being constructed) 

parameter stack size (macro parameters) 

buffer size (characters in lines being read from files) 

save size (values to restore at group ends) 

text input levels (\input files and error insertions) 

grouping levels (unfinished groups) 

pattern memory (hyphenation pattern data) 

The current amount of memory available will also be shown. 

If you have a job that doesn’t overflow TjrX’s capacity, yet you want to see 
just how closely you have approached the limits, just set \tracingstats to 
a positive value before the end of your job. The log file will then conclude with a 
report on your actual usage of the first eleven things named above (i.e., the number of 
strings, . . . , the save size), in that order. Furthermore, if you set \tracingstats equal 
to 2 or more, TjgX will show its current memory usage whenever it does a \shipout 
command. Such statistics are broken into two parts: ‘ 490 & 5950 ’ means, for example, 
that 490 words are being used for “large” things like boxes, glue, and breakpoints, 
while 5950 words are being used for “small” things like tokens and characters. 

What can be done if TpX’s capacity is exceeded? All of the above-listed 
components of the capacity can be increased, provided that your computer 
is large enough; in fact, the space necessary to increase one component can usually 
be obtained by decreasing some other component, without increasing the total size of 
TjrX. If you have an especially important application, you may be able to convince 
your local system people to provide you with a special whose capacities have been 
hand-tailored to your needs. But before taking such a drastic step, be sure that you are 
using TjgX properly. If you have specified a gigantic paragraph or a gigantic alignment 
that spans more than one page, you should change your approach, because IjjX has to 
read all the way to the end before it can complete the line-breaking or the alignment 
calculations; this consumes huge amounts of memory space. If you have built up an 
enormous macro library, you should remember that TjjX has to remember all of the 
replacement texts that you define; therefore if memory space is in short supply, you 
should load only the macros that you need. (See Appendices B and D, for ideas on 
how to make macros more compact.) 





TeX capacity exceeded 
tracingstats 
stack positions 
shipout 
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Some erroneous TgX programs will overflow any finite memory capacity. For 
example, after ‘\def\recurse{ (\recurse)}’, the use of \recurse will imme- 


diately bomb out: 


! TeX capacity exceeded, sorry [input stack size=80] . 
\recurse ->(\recurse 

) 

\recurse ->(\recurse 

) 


recursion 

save size 

grouping 

save stack 

global assignments 

tracingrestores 

right brace 

tracinglostchars 

tracingoutput 


The same sort of error will obviously occur no matter how much you increase T^rX’s 
input stack size. 

The special case of “save size” capacity exceeded is one of the most trou- 
JL JL blesome errors to correct, especially if you run into the error only on long 
jobs. TgX generally uses up two words of save size whenever it performs a non-global 
assignment to some quantity whose previous value was not assigned at the same level 
of grouping. When macros are written properly, there will rarely be a need for more 
than 100 or so things on the “save stack”; but it’s possible to make save stack usage 
grow without limit if you make both local and global assignments to the same variable. 
You can figure out what TjgX puts on the save stack by setting \tracingrestores=l; 
then your log file will record information about whatever is removed from the stack at 
the end of a group. For example, let \a stand for the command ‘\advance\day by 1’; 
let \g stand for ‘\global\advance\day by 1’; and consider the following commands: 

\day=l {\a\g\a\g\a} 

The first \a sets \day=2 and remembers the old value \day=l by putting it on the 
save stack. The first \g sets \day=3, globally; nothing needs to go on the save stack 
at the time of a global assignment. The next \a sets \day=4 and remembers the 
old value \day=3 on the save stack. Then \g sets \day=5; then \a sets \day=6 and 
remembers \day=5. Finally the causes TjjX to go back through the save stack; if 
\tracingrestores=l at this point, the log file will get the following data: 

{restoring \day=5} 

{retaining \day=5} 

{retaining \day=5} 


Explanation: The \day parameter is first restored to its global value 5. Since this 
value is global, it will be retained, so the other saved values (\day=3 and \day=l) are 
essentially ignored. Moral: If you find TjgX retaining a lot of values, you have a set of 
macros that could cause the save stack to overflow in large enough jobs. To prevent 
this, it’s usually wise to be consistent in your assignments to each variable that you 
use; the assignments should either be global always or local always. 

/>)/>) TfcX provides several other kinds of tracing in addition to \tracingstats and 
JL JL \tracingrestores: We have already discussed \tracingcommands in Chapters 
13 and 20, \tracingparagraphs in Chapter 14, \tracingpages in Chapter 15, and 
\tracingmacros in Chapter 20. There is also \tracinglostchars, which (if positive) 
causes T{rX to record each time a character has been dropped because it does not 
appear in the current font; and \tracingoutput, which (if positive) causes TjgX to 
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display in symbolic form the contents of every box that is being shipped out to the 
dvi file. The latter allows you to see if things have been typeset properly, if you’re 
trying to decide whether some anomaly was caused by TgX or by some other software 
that acts on T^X’s output. 

f When T)?X displays a box as part of diagnostic output, the amount of data 
is controlled by two parameters called \showboxbreadth and \showboxdepth. 
The first of these, which plain l^X sets equal to 5, tells the maximum number of items 
shown per level; the second, which plain TJnX sets to 3, tells the deepest level. For 
example, a small box whose full contents are 


\hbox (4. 30554+1. 94444)x21 . 0 , glue set 0.5 
. \hbox (4 . 30554+1 . 94444) x5 . 0 
. . \tenrm g 
. \glue 5.0 plus 2.0 
.\tenrm I (ligature ) 


will be abbreviated as follows when \showboxbreadth=l and \showboxdepth=l: 

\hbox (4. 30554+1. 94444)x21.0, glue set 0.5 
Ahbox (4. 30554+1. 94444) x5.0 [] 

. etc . 


And if you set \showboxdepth=0, you get only the top level: 


\hbox (4. 30554+1. 94444)x21.0, glue set 0.5 [] 


(Notice how 1 [] ’ and ‘etc. ’ indicate that the data has been truncated.) 

A nonempty hbox is considered “overfull” if its glue cannot shrink to achieve 
the specified size, provided that \hbadness is less than 100 or that the excess 
width (after shrinking by the maximum amount) is more than \hfuzz. It is “tight” if 
its glue shrinks and the badness exceeds \hbadness; it is “loose” if its glue stretches 
and the badness exceeds \hbadness but is not greater than 100; it is “underfull” if 
its glue stretches and the badness is greater than \hbadness and greater than 100. 
Similar remarks apply to nonempty vboxes. T^X prints a warning message and displays 
the offending box, whenever such anomalies are discovered. Empty boxes are never 
considered to be anomalous. 



/>>/>) When an alignment is “overfull” or “tight” or “loose” or “underfull,” you don’t 
JL JL get a warning message for every aligned line; you get only one message, and 
TJnX displays a prototype row (or, with \valign, a prototype column). For example, 
suppose you say ‘\tabskip=0pt pluslOpt \halign to200pt{&#\hf il\cr . . Acr}’, and 
suppose that the aligned material turns out to make two columns of widths 50 pt and 
60 pt, respectively. Then you get the following message: 

Underfull \hbox (badness 2698) in alignment at lines 11 — 18 

□ □ 

\hbox (0 . 0+0 . 0)x200. 0, glue set 3.0 
. \glue (\tabskip) 0.0 plus 10.0 
. \unsetbox(0. 0+0. 0)x50. 0 
. \glue (\tabskip) 0.0 plus 10.0 
. \unsetbox (0 . 0+0 . 0) x60 . 0 
. \glue (\tabskip) 0.0 plus 10.0 


dvi 

sill pouf 

showboxbreadth 
sliowboxdepth 
internal box format 
symbolic box format 
ligature 
em-dash 
[] 

etc. 

overfull 

glue 

hbadness 

hfuzz 

tight 

badness 

loose 

underfull 

alignment 

prototype row 

valign 

halign 
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The “unset boxes” in a prototype row show the individual column widths. In this case 
the t.abskip glue has to stretch 3.0 times its stretchability, in order to reach the 200 pt 
goal, so the box is underfull. (According to the formula in Chapter 14, the badness 
of this situation is 2700; IjjX actually uses a similar but more efficient formula, so it 
computes a badness of 2698.) Every line of the alignment will be underfull, but only 
the prototype row will be displayed in a warning message. “Overfull rules” are never 
appended to the lines of overfull alignments. 

/>)/>) The \tracing. . . commands put all of their output into your log file, unless 
JL JL the \tracingonline parameter is positive; in the latter case, all diagnostic 
information goes to the terminal as well as to the log file. Plain IpX has a \tracingall 
macro that turns on the maximum amount of tracing of all kinds. It not only sets up 
\tracingcommands, \tracingrestores, \tracingparagraphs, and so on, it also sets 
\tracingonline=l, and it sets \showboxbreadth and \showboxdepth to extremely high 
values, so that the entire contents of all boxes will be displayed. 

/gb/gb Some production versions of TgX have been streamlined for speed. These irn- 
JL JL plementations don’t look at the values of the parameters \tracingparagraphs, 
\tracingpages, \tracingstats, and \tracingrestores, because TfrX runs faster when 
it doesn’t have to maintain statistics or keep tabs on whether tracing is required. If 
you want all of TjgX’s diagnostic tools, you should be sure to use the right version. 

/gb/gb If you set \pausing=l, IpX will give you a chance to edit each line of input 
JL JL as it is read from the file. In this way you can make temporary patches (e.g., 
you can insert \show. . . commands) while you’re troubleshooting, without changing 
the actual contents of the file, and you can keep TjjX running at human speed. 

Final hint: When working on a long manuscript, it’s best to prepare only 
a few pages at a time. Set up a “galley” file and a “book” file, and enter your 
text on the galley file. (Put control information that sets up your basic format 
at the beginning of this file; an example of galley.tex appears in Appendix E.) 
After the galleys come out looking right, you can append them to the book file; 
then you can run the book file through TgX occasionally, in order to see how the 
pages really fit together. For example, when the author prepared this manual, 
he did one chapter at a time, and the longer chapters were split into subchapters. 

(£)(£)*■ EXERCISE 27.5 

J_ JL Final exercise: Find all of the lies in this manual, and all of the jokes. 

Final exhortation: Go FORTH now and create masterpieces of the publishing art! 


Who can understand his errors? 
— Psalm 19:12 (c. 1000 B.C.) 
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It is one thing, to shew a Man that he is in an Error, 
and another, to put him in possession of Truth. 
— JOHN LOCKE, An Essay Concerning Humane Understanding (1690) 
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The preface to this manual points out the wisdom of trying to figure out each 
exercise before you look up the answer here. But these answers are intended to 
be read, since they occasionally provide additional information that you are best 
equipped to understand when you have just worked on a problem. 

1.1. A TjgXnician (underpaid); sometimes also called a TpX acker. 

2.1. Alice said, ‘‘I always use an en-dash instead of a hyphen when 
specifying page numbers like ‘480 — 491’ in a bibliography . ’ ’ (The wrong 
answer to this question ends with ’480-491’ in a bibliography.") 

2.2. You get em-dash and hyphen ( — ), which looks awful. 

2.3. fluffier firefly fisticuffs, flagstaff fireproofing, chiffchaff and riffraff. 

2.4. ‘ ‘\thinspace‘; and either or or something similar. Reason: 

There’s usually less space preceding a single left quote than there is preceding a double 
left quote. (Left and right are opposites.) 

2.5. Eliminating \thinspace would mean that a user need not learn the term; 
but it is not advisable to minimize terminology by “overloading’’ math mode with 
tricky constructions. For example, a user who wishes to take advantage of TjrjK’s 
\mathsurround feature would be thwarted by non-mathematical uses of dollar signs. 
(Incidentally, neither \thinspace nor \ , are built into TjjX; both are defined in terms 
of more primitive features, in Appendix B.) 

3.1. \I, \exercise, and \\. (The last of these is of type 2, i.e., a control symbol, 
since the second backslash is not a letter; the first backslash keeps the second one from 
starting its own control sequence.) 

3.2. math\ ’ematique and centim\ ‘etre. 

3.3. According to the index, \ u is primitive but \ (return) isn’t. The command 
‘\def\“~M{\ }’ in Appendix B is what actually defines \ (return), since a return is repre- 
sentable as **M. Asking TjjX to \show\“M produces the response ‘> \“M=macro : ->\ u . | 

3.4. There are 256 of length 2; most of these are undefined when TjjX begins. 
(TjgX allows any character to be an escape, but it does not distinguish between control 
sequences that start with different escape characters.) If we assume that there are 52 
letters, there are exactly 52 2 possible control sequences of length 3 (one for each pair 
of letters, from AA to zz). But Chapter 7 explains how to use \catcode to change any 
character into a “letter”; therefore it’s possible to use any of 256 2 potential control 
sequences of length 3. 

4.1. Ulrich Dieter, {\sl Journal f\"ur die reine und angewandte 
Mathematik\/ \bf201} (1959), 37—70. 

It’s convenient to use a single group for both \sl and \bf here. The ! \/’ is a refinement 
that you might not understand until you read the rest of Chapter 4. 

4.2. {\it Explain ... typeset a\/ {\rm roman} word ... sentence.} Note 
the position of the italic correction in this case. 

4.3. \def\ic#l{\setboxO=\hbox{#l\/}\dimenO=\wdO 
\setboxO=\hbox{#l}\advance\dimenO by -\wd0}. 


thinspace 

mathsurround 


catcode 

letter 
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4.4. Control word names are made of letters, not digits. 

4.5. Say \def\sl{\it} at the beginning, and delete other definitions of \sl that 
might be present in your format file (e.g., there might be one inside a \tenpoint macro). 

4.6. \font\squinttenrm=cmrlO at 5pt 
\font\squinttenrm=cmrlO scaled 500 

5.1. {shelf}ful or shelf-Qful, etc.; or even shelf\/ful, which yields a slielfful 
instead of a slielfful. In fact, the latter idea — to insert an italic correction — is prefer- 
able because TjrX will reinsert the ff ligature by itself after hyphenating shelf{}ful. 
(Appendix H points out that ligatures are put into a hyphenated word that contains no 
“explicit kerns,” and an italic correction is an explicit kern.) But the italic correction 
may be too much (especially in an italic font); shelf {\kernOpt}ful is often best. 

5.2. ‘u-CuV or ‘uOuOub etc. Plain TjjX also has a \space macro, so you can type 
\space\space\space. (These aren’t strictly equivalent to ‘\u\u\u\ since they adjust 
the spaces by the current “space factor,” as explained later.) 

5.3. In the first case, you get the same result as if the innermost braces had not 
appeared at all, because you haven’t used the grouping to change fonts or to control 
spacing or anything. TgX doesn’t mind if you want to waste your time making groups 
for no particular reason. But in the second case, the necessary braces were forgotten. 
You get the letter ‘S’ centered on a line by itself, followed by a paragraph that begins 
with ‘o should this.’ on the next line. 

5.4. You get the same result as if another pair of braces were present around ‘\it 
centered’, except that the period is typeset from the italic font. (Both periods look 
about the same.) The \it font will not remain in force after the \centerline, but 
this is something of a coincidence: T£X uses the braces to determine what text is to 
be centered, but then it removes the braces. The \centerline operation, as defined in 
Appendix B, puts the resulting braceless text inside another group; and that’s why \it 
disappears after \centerline. (If you don’t understand this, just don’t risk leaving 
out braces in tricky situations, and you’ll be OK.) 

5.5. \def\ital#l{{\it#l\/}}. Pro: Users might find this easier to learn, because 
it works more like \centerline and they don’t have to remember to make the italic 
correction. Con: To avoid the italic correction just before a comma or period, users 
should probably be taught another control sequence; for example, with 

\def\nocorr{\kernOpt } 

a user could type ‘ \ it al{ comma} or \ital{period\nocorr} , ’. The alternative of 
putting a period or comma in italics, to avoid the italic correction, doesn’t look as 
good. A long sequence of italics would be inefficient for TjgX, since the entire text for 
the argument to \ital must be read into memory only to be scanned again. 

5.6. {1 {2 3 4 5} 4 6} 4. 

5.7. \def\beginthe#l{\begingroup\def\blockname{#l}} 
\def\endthe#l{\def\test{#l}'/, 

\ifx\test\blockname\endgroup 
\else\errmessage{You should have said 
\string\endthe{\blockname}}\f i} 
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6.1. Laziness and/or obstinacy. 

6.2. There’s an unwanted space after ‘called — because (as the book says) TgX 
treats the end of a line as if it were a blank space. That blank space is usually what 
you want, except when a line ends with a hyphen or a dash; so you should WATCH OUT 
for lines that end with hyphens or dashes. 

6.3. It represents the heavy bar that shows up in your output. (This bar wouldn’t 
be present if \overfullrule had been set to Opt, nor is it present in an underfull box.) 

6.4. This is the \parfillskip space that ends the paragraph. In plain TgX the 
parfillskip is zero when the last line of the paragraph is full; hence no space actually 
appears before the rule in the output of Experiment 3. But all liskips show up as spaces 
in an overfull box message, even if they’re zero. 

6.5. Run T^X with \hsize=1.5in \tolerance=10000 \raggedright \hbadness=-l 
and then \input story. T)/X will report the badness of all lines (except the final lines 
of paragraphs, where fill glue makes the badness zero). 

6.6. \def\extraspace{\nobreak \hskip Opt plus . 15em\relax} 

\def\dash{\unskip\extraspace \extraspace} 

(If you try this with the story at 2-inch and 1.5-inch sizes, you will notice a substantial 
improvement. The \unskip allows people to leave a space before typing \dash. TgX 

will try to hyphenate before \dash, but not before ‘ ’; cf. Appendix H. The \relax 

at the end of \extraspace is a precaution in case the next word is ‘minus’.) 

6.7. TgX would have deleted five tokens: 1, i, n, u , \centerline. (The space was 
at the end of line 2, the \centerline at the beginning of line 3.) 

6.8. A control sequence like \centerline might well define a control sequence 
like \ERR0R before telling TgX to look at #1. Therefore TgX doesn’t interpret control 
sequences when it scans an argument. 

7.1. Three forbidden characters were used. One should type 
Procter \& Gamble’s ... \$2, a 10V/, gain. 

(Also the facts are wrong.) 

7.2. Reverse slashes (backslashes) are fairly uncommon in formulas or text, and \\ 
is very easy to type; it was therefore felt best not to reserve \\ for such limited use. 
Typists can define \\ to be whatever they want (including \backslash). 

7.3. 1, 2, 3, 4, 6, 7, 8, 10, 11, 12, 13. Active characters (type 13) are somewhat 
special; they behave like control sequences in most cases (e.g., when you say ‘\let\x=~’ 
or ‘\ifx\x~’), but they behave like character tokens when they appear in the token list 
of \uppercase or \lowercase, and when unexpanded after \if or \ifcat. 

7.4. It ends with either > or } or any character of category 2; then the effects of all 
\catcode definitions within the group are wiped out, except those that were \global. 
T)rX doesn’t have any built-in knowledge about how to pair up particular kinds of 
grouping characters. New category codes take effect as soon as a \catcode assignment 
has been digested. For example, 
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is a complete group. But without the space after ‘2’ it would not be complete, since T|'X 
would have read the “>’ and converted it to a token before knowing what category code 
was being specified; TjgX always reads the token following a constant before evaluating 
that constant. 

7.5. If you type l \message{\string~}’ and l \message{\string\~}’, Tj?X responds 
with and ‘\~\ respectively. To get \i 2 from \string you therefore need to make 
backslash an active character. One way to do this is 

{\catcode ‘/=0 \catcode ‘\\=13 /message{/string\}} 

(The “null control sequence” that you get when there are no tokens between \csname 
and \endcsname is not a solution to this exercise, because \string converts it to 
‘\csname\endcsname’. There is, however, another solution: If TjgX’s \escapechar 
parameter — which will be explained in one of the next dangerous bends — is negative 
or greater than 255, then ‘\string\\’ works.) 

7.6. \ 12 a .12 \ 12 uio hi 2 - 

7.7. \def\ifundef ined#l{\expandafter\ifx\csname#l\endcsname\relax} 

Note that a control sequence like this must be used with care; it cannot be included in 
conditional text, because the \ifx will not be seen when \ifundefined isn’t expanded. 

7.8. First \uppercase produces ‘A\lowercase{BC}’; then you get ‘Abe’. 

7.9. ‘ \copyright\ \uppercase\expandafter{\romannumeral\year} ’. (This is 
admittedly tricky; the ‘\expandafter’ expands the token after the ‘{’, not the token 
after the group.) 

7.10. (We assume that parameter #2 is not simply an active character, and that 
\escapechar is between 0 and 255.) 

\def\gobble#l{} '/, remove one token 
\def\appendroman#l#2#3{\edef#l{\csname 

\expandafter\gobble\string#2\romannumeral#3\endcsname}} 

8.1. The ’/, would be treated as a comment character, because its category code 
is 14; thus, no '/, token or } token would get through to the gullet of IjgX where numbers 
are treated. When a character is of category 0, 5, 9, 14, or 15, the extra \ must be 
used; and the \ doesn’t hurt, so you can always use it to be safe. 

8.2. (a) Both characters terminate the current line; but a character of category 5 
might be converted into u io or a |par | token, while a character of category 14 never 
produces a token, (b) They produce character tokens stamped with different category 
numbers. For example, $3 is not the same token as $ 4 , so TjgX’s digestive processes 
will treat them differently, (c) Same as (b), plus the fact that control sequence names 
treat letters differently, (d) No. (e) Yes; characters of category 10 are ignored at the 
beginning of every line, since every line starts in state N. (f) No. 

8.3. TpX had just read the control sequence \vship, so it was in state S, and it 
was just ready to read the space before Tin’. Afterwards it ignored that space, since it 
was in state S; but if you had typed I\obeyspaces in response to that error message, 
you would have seen the space. Incidentally, when TgX prints the context of an error 
message, the bottom pair of lines conies from a text file, but the other pairs of lines 


constant 

message 

null control sequence 

escapechar 

conditional 

expandafter 

escapechar 

context of an error message 



Appendix A: Answers to All the Exercises 


are portions of token lists that T)gX is reading (unless they begin with “<*>’, when they 
represent text inserted during error recovery). 

8.4. $3 xn "7 2i2 $3 ~i3 uio |TeX| bi 2 vi 2 u io- The final space conies from the 
(return) placed at the end of the line. Code “6 yields v only when not followed by 0-9 
or a-f . The initial space is ignored, because state N governs the beginning of the line. 

8.5. Hu in ! i2 uio | par | [par]. The ‘ u ’ comes from the (return) at the end of the 
first line; the second and third lines each contribute a |par | . 

8 .6. The two “B’s are not recognized as consecutive superscript characters, since 
the first ~~B is converted to code 2 which doesn’t equal the following character % Hence 
the result is seven tokens: “B7 ""B7 Mu | B | u i 0 ‘"Mu |M~~M| . The last of these is a 
control word whose name has two letters. The (space) after \M is deleted before Tp'X 
inserts the (return) token. 

8.7. Both alternatives work fine in text; in particular, they combine as in \lq\lq 
to form ligatures. But the definition in Appendix B works also in connection with 
constants; e.g., \char\lq\'/, and \char\rql40 are valid. (Incidentally, the construction 
\let\lq=‘ would not work with constants, since the quotes in a (number) must come 
from character tokens of category 12; after \let\lq=‘ the control sequence token \lq 
will not expand into a character token, nor is it a character token!) 

9.1. na\"\i ve or na{\"\i}ve or na\"{\i}ve. 

9 . 2 . Beloved protege; role coordinator; souffles, crepes, pates, etc. 

9.3. \AE sop’s \0E uvres en fran\c cais. 

9.4. {\sl Commentarii Academi\ae\ scientiarum imperialis 
petropolitan\ae\/} is now {\sl Akademi\t\i a Mauk SSSR, Doklady}. 

9.5. Ernesto CesVaro, P\’al Erd\H os, \0 ystein Ore, Stanis\l aw X’Swier*/, 
czkowski , Serge\u\i\ \t Iur’ev, Mu\d haramad ibn M\~us\~a al-Khw\‘arizm\‘\i . 

9.6. The proper umlaut is \H, which isn’t available in \tt, so it’s necessary to 
borrow the accent from another font. For example, {\tt P\’al Erd{\bf\H{\tt o}}s} 
uses a bold accent, which is suitably dark. 

9.7. {\it Europe on {\sl\$}15.00 a day\/} 

9.8. The extra braces keep font changes local. An argument makes the use of V 
more consistent with the use of other accents like \d, which are manufactured from 
other characters without using the \accent primitive. 

10 . 1 . Exactly 7227 pt. 

10 . 2 . —.013837 in, 0. mm, +42.1 dd, 3 in, 29 pc, 123456789 sp. (The lines of text in 
this manual are 29 picas wide.) 

10 . 3 . The first is not allowed, since octal notation cannot be used with a decimal 
point. The second is, however, legal, since a (number) can be hexadecimal according 
to the rule mentioned in Chapter 8; it means 12 cc, which is 144 dd 154.08124 pt. 
The third is also accepted, since a (digit string) can be empty; it is a complicated way 
to say 0 sp. 
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10.4. \def\tick#l{\vrule height Opt depth #lpt} 

\def\\{\hbox to lcm{\hf il\tick4\hf il\tick8}} 
\vbox{\hrule\hbox{\tick8\\\\\\\\\\\\\\\\\\\\}} 

(You might also try putting ticks at every millimeter, in order to see how good your 
system is; some output devices can’t handle 101 rules all at once.) 

10.5. For example, say ‘\magnif ication=\magstepl \input story \end’ to get 
magnification 1200; \magstep2 and \magstep3 are 1440 and 1728. Three separate runs 
are needed, since there can be at most one magnification per job. The output may look 
funny if the fonts don’t exist at the stated magnifications. 

10.6. Magnification is by a factor of 1.2. Since font \first is cmrlO at 12 pt, it will 
be cmrlO at 14.4 pt after magnification; font \second will be cmrlO at 12 pt. (Tj?X 
changes Y2truept’ into TOpt’, and the final output magnifies it back to 12 pt.) 

11.1. This E is inside a box that’s inside a box. 

11.2. The idea is to construct a box and to look inside. For example, 
\setbox0=\hbox{\sl g\/} XshowboxO 

reveals that V is implemented by placing a kern after the character. Further experiment 
shows that this kern is inserted even when the italic correction is zero. 

11.3. The height, depth, and width of the enclosing box should be just large enough 
to enclose all of the contents, so the result is: 

\hbox (8 . 98608+0 . 0)x24 . 44484 
. \tenrm T 
Akern 1.66702 

Ahbox (6. 83331+0. 0)x6. 80557, shifted -2.15277 
. . \tenrm E 
Akern 1.25 
. \tenrm X 

(You probably predicted a height of 8.9861; IjrX’s internal calculations are in sp, not 
pt/100000, so the rounding in the fifth decimal place is not readily predictable.) 

11.4. No applications of such symmetrical boxes to English-language printing were 
apparent; it seemed pointless to carry extra generality as useless baggage that would 
rarely if ever be used, merely for the sake of symmetry. In other words, the author 
wore a computer science cap instead of a mathematician’s mantle on the day that TjrjX’s 
boxes were born. Time will tell whether or not this was a fundamental error! 

11.5. The following solution is based on a general \makeblankbox macro that prints 
the edges of a box using rules of given thickness outside and inside that box; the box 
dimensions are those of \box0. 

\def\dolist{\afterassignment\dodolist\let\next= } 
\def\dodolist{\ifx\next\endlist \let\next\relax 
\else \\\let\next\dolist \fi 
\next} 

\def\endlist{\endlist} 
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\def\hidehrule#l#2{\kern-#l'/, 

\hrule heightil depth#2 \kern-#2 } 
\def\hidevrule#l#2{\kern-#l{\dimen0=#l 

\advance\dimenO by#2\vrule width\dimen0}\kern-#2 } 
\def\makeblankbox#l#2{\hbox{\lower\dp0\vbox{\hidehrule{#l}{#2}'/, 
\kern-#l */, overlap the rules at the corners 
\hbox to \wd0{\hidevrule{#l}{#2}*/, 

\raise\htO\vbox to #1 {}'/, set the vrule height 
\lower\dpO\vtop to #1{}'/, set the vrule depth 
\hf il\hidevrule{#2}{#l}}'/, 

\kern-#l\hidehrule{#2}{#l}}}} 

\def\maketypebox{\makeblankbox{Opt}{lpt}} 

\def \makelightbox{\makeblankbox{ . 2pt}{ . 2pt}} 
\def\\{\expandafter\if\space\next\ 

\else \setboxO=\hbox{\next}\maketypebox\f i} 
\def\demobox#l{\setboxO=\hbox{\dolist#l\endlist}*/, 
\copyO\kern-\wdO\makelightbox} 

11.6. \def\frac#l/#2{\leavevmode\kern. lem 

\raise . 5ex\hbox{\the\scriptf onto #l}\kern- . lem 
/\kern- . 15em\lower . 25ex\hbox{\the\scriptfont0 #2}} 

12.1. 9 + 16 units, 9 + 32 units, 12 + 0 units. (But T)?X would consider so much 
stretching to be “infinitely bad.” ) 

12.2. ‘What happens now?’ is placed in a line of width \hsize, with twice as much 
space at the left as at the right; ‘and now?’ is put flush right on the following line. 

12.3. The first two give an “overfull box” if the argument doesn’t fit on a line; 
the third allows the argument to stick out into the margins instead. (Plain l^X’s 
\centerline is \centerlinec; the stickout effect shows up in the narrow-column ex- 
periment of Chapter 6.) If the argument contains no infinite glue, \centerlinea and 
\centerlineb produce the same effect; but \centerlineb will center an argument that 
contains ‘fiT glue. 

12.4. Mr.~\& Mrs. "User were married by Eev. "Drofnats, who preached on 
Matt . "19\thinspace : \thinspace3 — 9. (Such thin spaces are traditional for Biblical 
references to chapter and verse, but you weren’t really expected to know that. Plain 
T^X defines \thinspace to be a kern, not glue; hence no break between lines will occur 
at a thinspace.) 

12.5. Donald"E.\ Knuth, ‘‘Mathematical typography ’ {\sl Bull.\ Amer.\ 
Math.\ Soc.\ \bfl} (1979), 337 — 372. (But the ‘V after ‘E . ’ isn’t necessary, 
because of a rule you will learn if you venture around the next dangerous bend.) 

12.6. There are several ways; perhaps the easiest are to type ‘\hbox{NASA> . ’ or 
‘NASA\null.’ (The \null macro is an abbreviation for ‘\hbox{}’.) 

12.7. 1000, except: 999 after B, S, D, and J; 1250 after the comma; 3000 after the 
exclamation point, the double-right-quote, and the periods. If a period had come right 
after the B (i.e., if the text had said ‘B. Sally’), the space factor after that period 
would have been 1000, not 3000. 


centerline 
Biblical references 
thinspace 
null 
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12.8. \box3 is 2pt high, 4pt deep, 3pt wide. Starting at the reference point of 
\box3, go right .75 pt and down 3pt to reach the reference point of \boxl; or go right 
lpt to reach the reference point of \box2. 

12.9. The stretch and shrink components of \baselineskip and \lineskip should 
be equal, and the \lineskiplimit should equal the normal \lineskip spacing, to 
guarantee continuity. 

12.10. Yes it did, but only because none of his boxes had a negative height or depth. 
He would have been safer if he had set \baselineskip=-1000pt, \lineskip=0pt, and 
\lineskiplimit=16383pt. (Plain TjgX’s \off interlineskip macro does this.) 

12.11. The interline glue will be zero, and the natural height is 1 + 1 — 3 + 2 = 1 pt 
(because the depth of \box2 isn’t included in the natural height); so the glue will 
ultimately become \vskip-lpt when it’s set. Thus, \box3 is 3pt high, 2pt deep, 4pt 
wide. Its reference point coincides with that of \box2; to get to the reference point of 
\boxl you go up 2pt and right 3pt. 

12.12. The interline glue will be 6pt minus 3 fil; the final depth will be zero, since 
\box2 is followed by glue; the natural height is 12 pt; and the shrinkability is 5 fil. So 
\box4 will be 4pt high, Opt deep, lpt wide, and it will contain five items: \vskip 
-1.6pt, \boxl, \vskipl.2pt, \moveleft4pt\box2, \vskip-l . 6pt. Starting at the ref- 
erence point of \box4, you get to the reference point of \boxl by going up 4.6 pt, or 
to the reference point of \box2 by going up .4 pt and left 4pt. (For example, you go 
up 4pt to get to the upper left corner of \box4; then down — 1.6 pt, i.e., up 1.6 pt, to 
get to the upper left corner of \boxl; then down lpt to reach its reference point. This 
problem is clearly academic, since it’s rather ridiculous to include infinite shrinkability 
in the baselineskip.) 

12.13. Now \box4 will be 4pt high, — 4pt deep, 1 pt wide, and it will contain \vskip 
-2.4pt, \boxl, \vskip-l . 2pt, \moveleft4pt\box2, \vskip-2 .4pt. From the baseline 
of \box4, go up exactly 5.4 pt to reach the baseline of \boxl, or exactly 3.6 pt to reach 
the baseline of \box2. 

12.14. \vbox to x{} produces height x; \vtop to x{} produces depth x\ the other 
dimensions are zero. (This holds even when x is negative.) 

12.15. There are several possibilities: 

\def\nullbox#l#2#3{\vbox to#l{\vss\hrule height-#2depth#2width#3}} 
works because the rule will be of zero thickness. Less tricky is 

\def\nullbox#l#2#3{\vbox to#l{\vss\vtop to#2{\vss\hbox to#3{}}}} 

Both of these are valid with negative height and/or depth, but they do not produce 
negative width. If the width might be negative, but not the height or depth, you can 
use, e.g., \def\nullbox#l#2#3{\hbox to#3{\hss\raise#l\null\lower#2\null}}. It’s 
impossible for \hbox to construct a box whose height or depth is negative; it’s impossible 
for \vbox or \vtop to construct a box whose width is negative. 

However, there’s actually a trivial solution to the general problem, based on 
features that will be discussed later: 

\def\nullbox#l#2#3{\setbox0=\null 
\ht0=#l \dp0=#2 \wd0=#3 \boxO } 


offinterlineskip 
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12.16. \def\llap#l{\hbox to Opt{\hss#l}} 

12.17. You get ‘A’ at the extreme left and ‘puzzle.’ at the extreme right, because the 
space between words has the only stretchability that is finite; the infinite stretchability 
cancels out. (In this case, TjrjX’s rule about infinite glue differs from what you would 
get in the limit if the value of 1 fil were finite but getting larger and larger. The true 
limiting behavior would stretch the text ‘A puzzle.’ in the same way, but it would also 
move that text infinitely far away beyond the right edge of the page.) 

13.1. Simply saying \hbox{. . .} won’t work, since that box will just continue the 
previous vertical list without switching modes. You need to start the paragraph explic- 
itly, and the straightforward way to do that is to say \indent\hbox{ . . . }. But suppose 
you want to define a macro that expands to an hbox, where this macro is to be used in 
the midst of a paragraph as well as at the beginning; then you don’t want, to force users 
to type \indent before calling your macro at the beginning of a paragraph, nor do you 
want to say \indent in the macro itself (since that might insert unwanted indenta- 
tions). One solution to this more general problem is to say ‘\ u \unskip\hbox{ . . .}’, 
since \ u makes the mode horizontal while \unskip removes the unwanted space. Plain 
IjgX provides a \leavevmode macro, which solves this problem in what is probably 
the most efficient way: \leavevmode is an abbreviation for ‘\unhbox\voidbox’, where 
\voidbox is a permanently empty box register. 

13.2. The output of \tracingcommands shows that four blank space tokens were 
digested; these originated at the ends of lines 2, 3, 4, and 5. Only the first had any 
effect, since blank spaces are ignored in math formulas and in vertical modes. 

13.3. The end-group character finishes the paragraph and the \vbox, and \bye 
stands for ‘\vfill. . .’, so the next three commands are 

{math mode : math shift character $} 

{restricted horizontal mode: end-group character }} 

{vertical mode: \vfill} 

13.4. It contains only mixtures of vertical glue and horizontal rules whose reference 
points appear at the left of the page; there’s no text. 

13.5. Vertical mode can occur only as the outermost mode; horizontal mode and 
display math mode can occur only when immediately enclosed by vertical or internal 
vertical mode; ordinary math mode cannot be immediately enclosed by vertical or 
internal vertical mode; all other cases are possible. 

14.1. (cf . "Chapter" 12) . 

Chapters 12 and"21. 

line" 16 of Chapter"6’s {\tt story} 

lines 7 to"ll 

lines 2, "3, 4, and"5. 

(2) "a big black bar 

All 256"characters are initially of category" 12, 
letter"{\tt x} in family"l. 

the factor"$f$, where $n$~is 1000"times"$f$ . 
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14.2. ‘for all $n$"greater than"$n_0$’ avoids distracting breaks. 
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14.3. 1 exercise \hbox{4.3.2 — 15}’ guarantees that there is no break after the en- 
dasli. But this precaution is rarely necessary, so ‘exercise 4.3.2 — 15’ is an acceptable 
answer. No ~ is needed; ‘4.3.2-15’ is so long that it causes no offense at the beginning 
of a line. 


14.4. The space you get from " will stretch or shrink with the other spaces in the 
same line, but the space inside an hbox has a fixed width since that glue has already 
been set once and for all. Furthermore the first alternative permits the word Chap- 
ter to be hyphenated. 

14.5. ‘\hbox{$x=0$}’ is unbreakable, and we will see later that ‘${x=0}$’ cannot be 
broken. Both of these solutions set the glue surrounding the equals sign to some fixed 
value, but such glue normally wants to stretch; furthermore, the \hbox solution might 
include undesirable blank space at the beginning or end of a line, if \mathsurround is 
nonzero. A third solution ‘$x=\nobreakO$’ avoids both defects. 

14.6. \exhyphenpenalty=10000 prohibits all such breaks, according to the rules 
found later in this chapter. Similarly, \hyphenpenalty=10000 prevents breaks after 
implicit (discretionary) hyphens. 

14.7. The second and fourth lines are indented by an additional “quad” of space, 
i.e., by one extra em in the current type style. (The control sequence \quad does an 
\hskip; when Tp)X is in vertical mode, \hskip begins a new paragraph and puts glue 
after the indentation.) If \indent had been used instead, those lines wouldn’t have 
been indented any more than the first and third, because \ indent is implicit at the 
beginning of every paragraph. Double indentation on the second and fourth lines could 
have been achieved by ‘\indent\indent’. 

14.8. ba\ck en and Be\ttt uch, where the macros \ck and \ttt are defined by 

\def\ck{\discretionary{k-}{k}{ck}} 

\def \ttt{tt \discret ionary {-}{t}{}} 

The English word ‘eighteen’ might deserve similar treatment. T)?X’s hyphenation al- 
gorithm will not make such spelling changes automatically. 

14.9. \def\break{\penalty-10000 } 

14.10. You get a forced break as if \nobreak were not present, because \break cannot 
be cancelled by another penalty. In general if you have two penalties in a row, their 
combined effect is the same as a single penalty whose value is the minimum of the two 
original values, unless both of those values force breaks. (You get two breaks from 
\break\break; the second one creates an empty line.) 

14.11. Breaks are forced when p < —10000, so there’s no point in subtracting a large 
constant whose effect on the total demerits is known a priori, especially when that 
might cause arithmetic overflow. 

14.12. (10 + 131 ) 2 + 0 2 + 10000 = 29881 and (10 + l) 2 + 50 2 + 10000 = 12621. In 
both cases the \adj demerits were added because the lines were visually incompati- 
ble (decent, then very loose, then decent); plain l}rX’s values for \linepenalty and 
\adj demerits were used. 


en-dash 

hyphenate 

mathsurround 

hskip 

adjdemerits 

linepenalty 



Appendix A: Answers to All the Exercises 315 


14.13. Because TjgX discards a glue item that occurs just before \par. Ben should 
have said, e.g., ‘\hfilneg\ \par’. 

14.14. Just say \parf illskip=\parindent. Of course, TjrjX will not be able to find 
appropriate line breaks unless each paragraph is sufficiently long or sufficiently lucky; 
but with an appropriate text, your output will be immaculately symmetrical. 

14.15. Assuming that the author is deceased and/or set in his or her ways, the remedy 
is to insert ‘{\parf illskip=Opt\par\parskip=Opt\noindent}’ in random places, after 
each 50 lines or so of text. (Every space between words is usually a feasible breakpoint, 
when you get sufficiently far from the beginning of a paragraph.) 

14.16. {\leftskip=-lpt \rightskip=lpt (text) \par} 

(This applies to a full paragraph; if you want to correct only isolated lines, you have 
to do it by hand.) 

14.17. ‘\def \line#l{\hbox to\hsize{\hskip\leftskip#l\hskip\rightskip}}’ is 
the only change needed. (Incidentally, displayed equations don’t take account of 
\leftskip and \rightskip either; it’s more difficult to change that, because so many 
variations are possible.) 

14.18. The author’s best solution is based on a variable \dimen register \x: 
\setboxl=\hbox{I} 

\setboxO=\vbox{\parshape=ll -0\x0\x -l\x2\x -2\x4\x -3\x6\x 

-4\x8\x -5\xl0\x -6\xl2\x -7\xl4\x -8\xl6\x -9\xl8\x -10\x20\x 
\ifdim \x>2em \rightskip=-\wdl 

\else \frenchspacing \rightskip=-\wdl pluslpt minuslpt 
\leftskip=Opt plus lpt minuslpt \fi 
\parf illskip=Opt \tolerance=1000 \noindent I turn, ... hand.} 
\centerline{\hbox to \wdl{\boxO\hss}} 

Satisfactory results are obtained with font cmrlO when \x is set to 8.9 pt, 13.4 pt, 
18.1 pt, 22.6 pt, 32.6 pt, and 47.2 pt, yielding triangles that are respectively 11, 9, 8, 7, 
6, and 5 lines tall. 

14.19. \item{} at the beginning of each paragraph that wants hanging indentation. 

14.20. \item{$\bullet$} 

14.21. Either change \hsize or \rightskip. The trick is to change it back again at 
the end of a paragraph. Here’s one way, without grouping: 

\let\endgraf=\par \edef \restorehsize{\hsize=\the\hsize} 
\def\par{\endgraf \restorehsize \let\par=\endgraf} 

\advance\hsize by-\parindent 

14.22. \dimenO=\hsize \advance\dimenO by 2em 
\parshape=3 0pt\hsize 0pt\hsize -2em\dimen0 

14.23. The three paragraphs can be combined into a single paragraph, if you use 
‘\hf il\vadjust{\vskip\parskip}\break\indent’ instead of ‘\par’ after the first two. 
Then of course you say, e.g., \hangindent=-50pt \hangafter=-15. (The same idea 
can be applied in connection with \looseness, if you want T}rX to make one of three 


displayed equations 
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paragraphs looser but. if you don’t want to choose which one it will be. However, long 
paragraphs fill T^X’s memory; please use restraint.) See also the next exercise. 

14.24. Use \hangcarryover between paragraphs, defined as follows: 

\def\hangcarryover{\edef\next{\hangafter=\the\hangafter 

\hangindent=\the\hangindent} 

\par\next 

\edef \next{\prevgraf=\the\prevgraf} 

\ indent \next} 

14.25. It will set the current paragraph in the minimum number of lines that can be 
achieved without violating the tolerance; and, given that number of lines, it will break 
them optimally. (However, nonzero looseness makes T£X work harder, so this is not 
recommended if you don’t want to pay for the extra computation. You can achieve 
almost the same result much more efficiently by setting \linepenalty=100, say.) 

14.26. 150, 100, 0, 250. (When the total penalty is zero, as between lines 3 and 4 in 
this case, no penalty is actually inserted.) 

14.27. \interlinepenalty plus \clubpenalty plus \widowpenalty (and also plus 
\brokenpenalty, if the first line ends with a discretionary break). 

14.28. The tricky part is to avoid “opening up” the paragraph by adding anything 
to its height; yet this star is to be contributed after a line having an unknown depth, 
because the depth of the line depends on details of line breaking that aren’t known until 
afterwards. The following solution uses \strut, and assumes that the line containing 
the marginal star does not have depth exceeding \dp\strutbox, the depth of a \strut. 

\def\strutdepth{\dp\strutbox} 

\def\marginalstar{\strut\vadjust{\kern-\strutdepth\specialstar}} 

Here \specialstar is a box of height zero and depth \strutdepth, and it puts an 
asterisk in the left margin: 

\def\specialstar{\vtop to \strutdepth{ 

\baselineskip\strutdepth 
\vss\llap{* }\null}} 

14.29. \def\insertbullets{\everypar={\llap{$\bullet$\enspace}}} 

(A similar device can be used to insert hanging indentation, and/or to number the 
paragraphs automatically. ) 

14.30. First comes \parskip glue (but you might not see it on the current page if you 
say \showlists, since glue disappears at the top of each page). Then comes the result 
of \everypar, but let’s assume that \everypar doesn’t add anything to the horizontal 
list, so that you get an empty horizontal list; then there’s no partial paragraph before 
the display. The displayed equation follows the normal rules (it occupies lines 1-3 of 
the paragraph, and uses the indentation and length of line 2, if there’s a nonstandard 
shape). Nothing follows the display, since a blank space is ignored after a closing “$$’. 

Incidentally, the behavior is different if you start a paragraph with ‘$$’ instead 
of with \noindent$$, since IgX inserts a paragraph indentation that will appear on a 
line by itself (with \leftskip and \parfillskip and \rightskip glue). 


linepenalty 

strut 

dp 

strut 

display at beginning of paragrapl 



Appendix A: Answers to All the Exercises 317 


14.31. A break at \penalty50 would cancel \hskip2em\nobreak\hf il, so the next 
line would be forced to start with the reviewer’s name flush left. (But \vadjust{} 
would actually be better than \hbox{}; it uses TjjX more efficiently.) 

14.32. Otherwise the line-breaking algorithm might prefer two final lines to one final 
line, simply in order to move a hyphen from the second-last line up to the third-last line 
where it doesn’t cause demerits. This in fact caused some surprises when the \signed 
macro was being tested; \tracingparagraphs=l was used to diagnose the problem. 

14.33. Distributing the extra space evenly would lead to three lines of the maximum 
badness (10000). It’s better to have just one bad line instead of three, since TjrX doesn’t 
distinguish degrees of badness when lines are really awful. In this particular case the 
\tolerance was 200, so IjvX didn’t try any line breaks that would stretch the first two 
lines; but even if the tolerance had been raised to 10000, the optimum setting would 
have had only one underfull line. If you really want to spread the space evenly you can 
do so by using \spaceskip to increase the amount of stretchability between words. 

14.34. \def\raggedcenter{\leftskip=Opt plus4em \rightskip=\leftskip 
\parf illskip=Opt \spaceskip=. 3333em \xspaceskip=. 5em 
\pretolerance=9999 \tolerance=9999 \parindent=Opt 
\hyphenpenalty=9999 \exhyphenpenalty=9999 } 

15.1. The last three page-break calculations would have been 

'/, t=503.0 plus 8.0 minus 4.0 g=528.0 b=3049 p=150 c=3199# 

'/, t=514.0 plus 8.0 minus 4.0 g=528.0 b=533 p=-100 c=433# 

/ t=542.0 plus 11.0 minus 6.0 g=528.0 b=* p=0 c=* 

so the break would have occurred at the same place. The badness would have been 533, 
but the page would still have looked tolerable. (On the other hand if that paragraph 
had been two lines shorter instead of one, the first two lines of the next “dangerous 
bend” paragraph would have appeared on that page; the natural height t = 531 pt 
would have been able to shrink to g = 528 pt because the three “medskips” on the page 
would have had a total shrinkability of 6 pt. This would certainly have been preferable 
to a stretched-out page whose badness was 3049; but the author might have seen it and 
written another sentence or two, so that the paragraph would not have been broken 
up. After all, this manual is supposed to be an example of good practice.) 

15.2. The next legal break after the beginning of a dangerous bend paragraph occurs 
28 pt later, because there is 6 pt additional space for a \medskip, followed by two lines 
of 11 pt each. TjrX does not allow breaking between those two lines; the \clubpenalty 
is set briefly to 10000 in Appendix E, since the dangerous bend symbol is two lines tall. 

15.3. A page always contains at least one box, if there are no insertions, since the 
legal breakpoints are discarded otherwise. Statement (a) fails if the height of the 
topmost box exceeds 10 pt. Statement (b) fails if the depth of the bottommost box 
exceeds 2.2 pt, or if some glue or kern comes between the bottommost box and the 
page break (unless that glue or kern exactly cancels the depth of the box). 

15.4. \topinsert\vskip2in\rightline{\vbox{\hsize ... artwork. }}\endinsert 
does the job. But it’s slightly more efficient to avoid \rightline by changing \leftskip 
as follows: ‘\leftskip=\hsize \advance\leftskip by-3in’. Then TgX doesn’t have 
to read the text of the caption twice. 
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15.5. It would appear on page 25, since it does fit there. A \midinsert will jump 
ahead of other insertions only if it is not carried over to another page; for example, 
if the second 3-inch insertion were a \midinsert, it would not appear on page 26, 
because it is converted to a \topinsert as soon as the \midinsert macro notices that 
the insertion is too big for page 25. 

15.6. Set \countl to 50, then \dimen2 to 50 pt, then \countl to 6, then \skip2 
to — 10 pt plus 6fil minus 50 pt, then \skip2 to 60 pt plus — 36fil minus — 300 pt, then 
\skip2 to lsp minus — 6sp, then \count6 to 1, then \skipl to 25 pt plus lsp mi- 
nus 1 fill, then \skip2 to 25 pt minus —150 pt, then \skipl to 0 pt plus 1 sp minus 1 fill. 

15.7. If \skip4 has infinite stretcliability, \skip5 will be zero; otherwise it will be 
Opt plus lpt. 

15.8. \advance\dimen2 by0.5\dimen3 \divide\dimen2 by\dimen3 
\multiply\dimen2 by\dimen3 

15.9. \countl takes the values 5, then 2 (the old 5 is saved), then 4 (which is made 
global), then 8 (and 4 is saved); finally the value 4 is restored, and that is the answer. 
(For further remarks, see the discussion of \tracingrestores in Chapter 27.) 

15.10. \hbox{\hbox{A}A}. After ‘\unhbox5’, \box5 is void; \unhcopy5 yields nothing. 

15.11. \hbox{A}. But after ‘{\global\setbox3=\hbox{A}\setbox3=\box3}’, \box3 
will be void. 

15.12. \newcount\notenumber 
\def\clearnotenumber{\notenumber=0\relax} 
\def\note{\advance\notenumber by 1 

\footnote{$~{\the\notenumber}$}} 

15.13. Yes, in severe circumstances. (1) If there is no other legal breakpoint, TjgX 
will take a break whose cost is oo. (2) If \vadjust{\eject} occurs on the same line 
as a footnote, before that footnote, the reference will be forcibly detached. (3) Other 
\vadjust commands on that line could also interpose breakpoints before the insertion. 

16.1. $\gairana+\nu\in\Gamma$. 

16.2. \le, \ge, and \ne. (These are short for “less-or-equal,” “greater-or-equal,” 
and “not-equal.” ) You can also use the names \leq, \geq, and \neq. (The fourth most 
common symbol is, perhaps, ‘oo’, which stands for “infinity” and is called ‘\infty’.) 

16.3. In the former, the ‘_2’ applies to the plus sign (a; + 2 F 3 ),; but in the latter, it 
applies to an empty subformula (x + 2 F 3 ). 

16.4. The results are ‘x v ~' and ‘x y ’; the z in the first alternative is the same size as 
the y, but in the second it is smaller. Furthermore, the y and a in the first case aren’t 
quite at the same height. (Good typists never even think of the first construction, 
because mathematicians never want it.) 

16.5. The second alternative doesn’t work properly when there’s a subscript at the 
same time as a prime. Furthermore, some mathematicians use \prime also in the 
subscript position; they write, for example, F'(w,z) = dF{w,z)/dz and F,(w,z) = 
dF(w, z)/dw. 
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16.6. $R_i{}~{jk}{}_l$. 

16.7. 10“{10}; 2~{n+l}; (n+l)“2; \sqrt{l-x"2}; \overline{w+\overline z}; 

p_l~{e_l}; a_{b_{c_{d_e}}}; \root3\of{h’ ’_n(\alpha x)}. (Of course, you should 
enclose these formulas in dollar signs so that TpX will process them in math mode. 
Superscripts and subscripts can be given in either order; for example, h ’ ’ _n and h_n ’ ’ 
both work the same. You should not leave out any of the braces shown here; for 
example, ‘$10"10$’ would yield ‘ 10 1 O’. But it doesn’t hurt to insert additional braces 
around letters or numbers, as in 1 ({n}+{l}) ~{2}’. The indicated blank spaces are 
necessary unless you use extra braces; otherwise TpX will complain about undefined 
control sequences \overlinez and \alphax.) 

16.8. He got Tfa: = y . . .’ because he forgot to leave a space after ‘If’; spaces dis- 
appear between dollar signs. He should also have ended the sentence with ‘$y$.’; 
punctuation that belongs to a sentence should not be included in a formula, as we will 
see in Chapter 18. (But you aren’t expected to know that yet.) 

16.9. Deleting an element from an $n$-tuple leaves an $ (n-1) $-tuple . 

16.10. Q, /, g, j, p, q, y. (The analogous Greek letters are 3, 7, (, 7, /(, £, p, 4 >, cp, x, ip-) 

16.11. $z"{*2}$ and $h_* ’ (z) $. 

16.12. $3{\cdot}1416$. (One of the earlier examples in this chapter showed that 
\cdot is a binary operation; putting it in braces makes it act like an ordinary symbol.) 

If you have lots of constants like this, for example in a table, there’s a way to 
make ordinary periods act like \cdot symbols: Just define \mathcode‘ . to be "0201, 
assuming that the fonts of plain Tj^X are being used. However, this could be dangerous, 
since ordinary periods are used frequently in displayed equations; the \mathcode change 
should be confined to places where every period is to be a \cdot. 

16.13. $e“{-x~2}$, $D\sim p~\alpha M+l$, and $\ghat\in(H~{\pi_l~{-l}}) ’ $. (If 
you are reading the dangerous bend sections, you know that the recommended way to 
define \ghat is ‘\def \ghat{{\hat g}}’.) 

17.1. x + y 2/(k+1) ($x+y-{2/(k+l)}$). 

17.2. ((a + l)/(6 + 1))® ($((a+l)/(b+l))x$). 

17.3. He got the displayed formula 

a- = (y 2 
k + 1) 

because he forgot that an unconfined \over applies to everything. (He should probably 
have typed ‘$$x=\left(y~2\over k+l\right) $$’, using ideas that will be presented 
later in this chapter; this not only makes the parentheses larger, it keeps the ‘ x =’ out 
of the fraction, because \left and \right introduce subformulas.) 

17.4. ‘$7{l\over2}\cents$’ or ‘7$l\over2$\cents’. (Incidentally, the definition 
used here was \def\cents{\hbox{\rm\rlap/c}}.) 

17.5. Style D' is used for the subformula p\ , hence style S' is used for the super- 
script e and the subscript 2, and style SS' is used for the supersuperscript prime. The 
square root sign and the p appear in text size; the 2 and the e appear in script size; 
and the / is in scriptscript size. 
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17.6. $${l\over2}{n\choose k}$$; $$\displaystyle{n\ choose k}\over2$$. All 

of these braces are necessary. 

17.7. $${p \choose 2} x"2 y"{p-2} - {1 \over 1 - x }{1 \over l-x“2}.$$ 

17.8. $$\sum_{i=l}~p\sum_{j=l}"q\sum_{k=l}“ra_{i j}b_{ jk}c_{ki}$$. 

17.9. $$\sum_{{\scriptstyle l\le i\le p \atop \scriptstyle l\le j\le q} 
\atop \scriptstyle l\le k\le r} a_{ij} b_{jk} c_{ki}$$. 

17.10. $\displaystyle\biggl ({\partial~2\over\partial x~2}+ 
{\partial"2\over\partial y"2}\biggr) \bigl I \varphi (x+iy)\bigr I "2=0$. 

17.11. Formulas that are more than one line tall are usually two lines tall, not l| or 
2 1 lines tall. 

17.12. $\bigl(x+f (x)\bigr) \big/ \bigl (x-f (x) \bigr) $. (Notice especially the 
‘\big/’; an ordinary slash would look too small between the \big parentheses. 

17.13. $$\pi (n)=\sum_{k=2}“n\left\lf loor\phi (k) \over k-l\right\rfloor . $$ 

17.14. $$\pi (n)=\sum_{m=2}“n\left\lf loor\biggl (\sum_{k=l}~{m-l}\bigl 
\lfloor (m/k) \big/\lceil m/k\rceil\bigr\rf loor\biggr) ~{-l}\right\rfloor . $$ 

17.15. A displayed formula equivalent to $${D}{{T}\over{T}"{{S}"{SS}}}$$. 

17.16. \def\sqr#l#2{{\vcenter{\vbox{\hrule height. #2pt 

\hbox{\vrule width. #2pt height#lpt \kern#lpt 
\vrule width. #2pt} 

\hrule height . #2pt}}}} 

\def\square{\mathchoice\sqr34\sqr34\sqr{2 . l}3\sqr{l . 5}3} 

17.17. \def\euler{\atopwithdelims<>}. 

17.18. The \textfont0 that was current at the beginning of the formula will be 
used, because this redefinition is local to the braces. (It would be a different story if 
‘\global\textfont’ had appeared instead; that would have changed the meaning of 
\textfont0 at all levels.) 

17.19. "2208 and "220F. 

17.20. \mathchardef\alpha="710B. Incidentally, {\rm\alpha} will then give a spu- 
rious result, because character position "0B of roman fonts does not contain an alpha; 
you should warn your users about what characters they are allowed to type under the 
influence of special conventions like \rm. 

17.21. If \delcode‘{ were set to some nonnegative delimiter code, you would get no 
error message when you wrote something like ‘\left{’. This would be bad because 
strange effects would happen when certain subformulas were given as arguments to 
macros, or when they appeared in alignments. But it has an even worse defect, because 
a user who gets away with ‘\left{’ is likely to try also ‘\bigl{’, which fails miserably. 

17.22. Since \bigl is defined as a macro with one parameter, it gets just Adelimiter’ 
as the argument. You have to write l \bigl{\delimiter"426830A}’ to make this work. 
On the other hand, \left will balk if the following character is a left brace. Therefore 
it’s best to have control sequence names for all delimiters. 
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18.1. $R(n, t) =0 (t"{n/2})$ , as $t\to0~+$. (N.B.: ‘0 (’, not ‘0 ( ’ .) 

18.2. $$p_l (n)=\lim_{m\to\infty}\sum_{\nu=0}"\infty 

\bigl (l-\cos~{2m} (\nu ! ~n\pi/n) \bigr) . $$ 

[Mathematicians may enjoy interpreting this formula; cf. G. H. Hardy, Messenger of 
Mathematics 35 (1906), 145-146.] 

18.3. \def\limsup{\mathop{\overline{\rm lim}}} 

\def\liminf {\mathop{\underline{\rm lim}}} 

[Notice that the limits ‘n — >■ oo’ appear at different levels, in both of the displays, 
because ‘sup’ and the underbar descend below the baseline. It is possible to unify the 
limit positions by using phantoms, as explained later in this chapter. For example, 

\def\limsup{\mathop{\vphantom{\underline{}}\overline{\rm lim}}} 

would give lower limits in the same position as \liminf.] 

18.4. x = 0( (mod y) n ). He should have typed l $x\equivO\pmod{y"n}$’. 

18.5. $${n\choose k}\equiv{\lf loor n/p\rf loor\choose 

\lfloor k/p\rfloor}{n\bmod p\choose k\bmod p}\pmod p.$$ 

18.6. $\bf\bar x"{\rm T}Mx={\rmO}\iff x=0$. (If you typed a space between 
\rm and 0, you wasted a keystroke; but don’t feel guilty about it.) 

18.7. $S\subseteq{\mit\Sigma}\iff S\in{\cal S}$. In this case the braces are 
redundant and could be eliminated; but you shouldn’t try to do everything with fewest 
keystrokes, or you’ll outsmart yourself some day. 

18.8. $${\it available}+\sum_{i=l}"n\max\bigl ({\it full}(i), 

{\it reserved} (i)\bigr)={\it capacity}. $$ 

[If \it had been used throughout the formula, the subscript i and superscript n would 
have caused error messages saying ‘\scriptfont 4 is undefined’, since plain Tj?X 
makes \it available only in text size.] 

18.9. {\obeylines \sf code ‘ ; =3000 

{\bf for $j:=2$ step $1$ until $n$ do} 

\quad {\bf begin} ${\it accum} : =A [j] $ ; $k:=j-l$; $A[0]:=\it accum$; 
\quad {\bf while $A[k]>\it accum$ do} 

\qquad {\bf begin} $A [k+1] : =A [k] $ ; $k:=k-l$; 

\qquad {\bf end} ; 

\quad $A[k+l]:=\it accum$; 

\quad {\bf end} . \par} 

[This is something like the “poetry” example in Chapter 14, but much more difficult. 
Some manuals of style say that punctuation should inherit the font of the preceding 
character, so that three kinds of semicolons should be typeset; e.g., these experts 
recommend ‘k := j — 1; H[0] := accum; end;’. The author heartily disagrees.] 

18.10. Let $H$~be a Hilbert space, \ $C$~a closed bounded convex subset 
of~$H$, \ $T$~a nonexpansive self map of~$C$. Suppose that as $n\to\infty$ , 
\ $a_{n,k}\to0$ for each~$k$, and $\gamma_n=\sum_{k=0}“\infty (a_{n,k+l}- 
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a_{n,k}) ~+\toO$. Then for each $x$~in~$C$, \ $A_nx=\sum_{k=0}"\infty 

a_{n,k}T~kx$ converges weakly to a fixed point of~$T$. 

[If any mathematicians are reading this, they might either appreciate or re- 
sent the following attempt to edit the given paragraph into a more acceptable style: 
“Let C be a closed, bounded, convex subset of a Hilbert space H, and let T be a non- 
expansive self map of C. Suppose that as n — > oo, we have a n ^ — > 0 for each k, 
and 7 „ = 5Z£L 0 (an,*+i — a n,k) + — > 0. Then for each x in C, the infinite sum 
A n x = XltLo a n,kT k x converges weakly to a fixed point of T.”] 

18.11. $$\int_0~\infty{t-ib\over t"2+b~2}e"{iat}\,dt= 

e~{ab}E_l (ab) , \qquad a,b>0.$$ 

18.12. $$\hbar=l . 0545\timesl0~{-27}\rm\ ,erg\ , sec . $$ 

18.13. There are ten atoms (the first is / and last is y 2 ); their types, and the inter- 
atomic spacing, are respectively 

Ol d Open Ord Punct \ , Ord Close \ ; Rel \ ; Ol d \> Bin \> Ord. 

18.14. $\left] -\infty ,T\right [\times\left] -\infty ,T\right [$. (Or one could 
say \mathopen and Vnathclose instead of \left and \right; then TjrX would not 
choose the size of the delimiters, nor would it consider the subformulas to be of type 
Inner.) Open intervals are more clearly expressed in print by using parentheses instead 
of reversed brackets; for example, compare ■(— oo ,T) x (— oo,T)’ to the given formula. 

18.15. The first + will become a Bin atom, the second an Ord; hence the result is x, 
medium space, +, medium space, -f, no space, 1 . 

18.16. $x_l+x_lx_2+\cdots+x_lx_2\ldots x_n$ and 

$ (x_l , \ldots ,x_n)\cdot (y _ 1 ,\ldots ,y_n) =x_ly_l+\cdots+x_ny_n$. 

18.17. The commas belong to the sentence, not to the formula; his decision to put 
them into math mode meant that Tf-\X didn’t put large enough spaces after them. Also, 
his formula ‘i = 1,2, , n' allows no breaks between lines, except after the =, so lie’s 
risking overfull box problems. But suppose the sentence had been more terse: 

Clearly a, < bj (i = 1, 2, . . . , n). 

Then his idea would be basically correct: 

Clearly $a_i<b_i$ \ ($i=l ,2 ,\ldots ,n$) . 

18.18. ... never\footnote*{Well \dots, hardly ever.} have ... 

18.19. Neither formula will be broken between lines, but the thick spaces in the 
second formula will be set to their natural width while the thick spaces in the first 
formula will retain their stretchability. 

18.20. Set \relpenalty=10000 and \binoppenalty=10000. And you also need to 
change the definitions of \bmod and \pmod, which insert their own penalties. 
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18.21. $\bigl\{\ ,x“3\bigm| h(x) \in\{-l , 0 , +1\}\ , \bigr\}$. 
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18.22. $\{\,p\mid p$~and $p+2$ are prime$\,\}$, assuming that \mathsurround 
is zero. The more difficult alternative ‘$\{\,p\mid p\ {\rm and}\ p+2\rm\ are\ 
prime\,\}$’ is not a solution, because line breaks do not occur at \ u (or at glue of any 
kind) within math formulas. Of course it may be best to display a formula like this, 
instead of breaking it between lines. 

18.23. $$f (x)=\cases{l/3&if $0\le x\lel$;\cr 2/3&if $3\le x\le4$;\cr 
O&elsewhere . \cr}$$ 
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18.24. $$\left\lgroup\matrix{a&b&c\cr d&e&f \cr}\right\rgroup 
\left\lgroup\matrix{u&x\cr v&y\cr w&z\cr}\right\rgroup$$. 

18.25. \pmatrix{y_l\cr \vdots\cr y_k\cr}. 

18.26. \def \undertext#l{$\underline{\smash{\hbox{#l}}}$} will underline the 
words and cross through the descenders; or you could insert \vphantom{y} before 
the \hbox, thereby lowering all of the underlines to a position below all descenders. 
Neither of these gives exactly what is wanted. (See also \underbar in Appendix B.) 
Underlining is actually not very common in fine typography, since font changes usually 
work just as well or better, when you want to emphasize something. If you really want 
underlined text, it’s best to have a special font in which all the letters are underlined. 

18.27. $n~{\rm th}$ root. (Incidentally, it is also acceptable to type ‘$n$th’, 
getting ‘nth’, in such situations; the fact that the n is in italics distinguishes it from 
the suffix. Typed manuscripts generally render this with a hyphen, but ‘n-tlT is frowned 
on nowadays when an italic n is available.) 

18.28. ${\bf S~{\rm-l}TS=dg} (\omega_l ,\ldots , \omega_n) =\bf \Lambda$. (Did 
you notice the difference between \omega (uj) and w (w)?) 

18.29. $\Pr(\,m=n\mid m+n=3\,)$. (Analogous to a set.) 

18.30. $\sinl8~\circ={l\over4} (\sqrt5-l)$. 

18.31. $k=l . 38\timesl0“{-16}\rm\ ,erg/~\circ K$. 

18.32. $\bar\Phi\subset NL_1~*/N=\bar L_l~* 

\subseteq\cdots\subseteq NL_n~*/N=\bar L_n~*$. 

18.33. $1 (\lambda) =\int\ !\ !\int_Dg(x,y)e~{i\lambda h(x,y)}\,dx\,dy$. 
(Although three \!’s work out best between consecutive integral signs in displays, the 
text style seems to want only two.) 

18.34. $\int_0~l\ ! \cdots\int_0~lf (x_l ,\ldots ,x_n) \ ,dx_l\ldots\ ,dx_n$. 

18.35. $$x_{2m}\equiv\cases{Q(X_m"2-P_2W_m"2)-2S“2ft($m$ odd)\cr 

\noalign{\vskip2pt} '/, spread the lines apart a little 
P_2"2 (X_m*2-P_2W_m"2) -2S"2& ($m$ even) \cr}\pmod N.$$ 


18.36. $$ (l+x_lz+x_l~2z~2+\cdots\ , )\ldots (l+x_nz+x_n~2z“2+\cdots\ , ) 
={l\over(l-x_lz)\ldots(l-x_nz)}.$$ (Notice the uses of \,.) 
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18.37. $$\prod_{ j\geO}\biggl (\sum_{k\geO}a_{ jk}z~k\biggr) bigggl 

=\sum_{n\geO}z~n\,\Biggl(\sum_ bigggr 

atopwitndelims 

{\scriptstyle k_0,k_l,\ldots\ge0\atop apostrophes 

\scriptstyle k_0+k_l+\cdots=n} mathop 

a_{0k_0}a_{lk_l}\ldots\ ,\Biggr) . $$ sumprime 


Some people would prefer to have the latter parentheses larger; but \left and \right 
come out a bit too large in this case. It’s not difficult to define \bigggl and \bigggr 
macros, analogous to the definitions of \biggl and \biggr in Appendix B. 

18.38. $${ (n_l+n_2+\cdots+n_m) ! \over n_l ! \ ,n_2 ! \ldots n_m!} 

={n_l+n_2\choose n_2}{n_l+n_2+n_3\choose n_3} 
\ldots{n_l+n_2+\cdots+n_m\ choose n_m} . $$ 

18.39. $$\def\\#l#2{ (l-q~{#l_#2+n})} '/, to save typing 

\Pi_R{a_l ,a_2 , \ldots ; a_M\atopwithdelims []b_l ,b_2 , \ldots ,b_N} 
=\prod_{n=0}~R{\\al\\a2\ldots\\aM\over\\bl\\b2\ldots\\bM} . $$ 

18.40. $$\sum_{p\rm\ ;prime}f (p)=\int_{t>l}f (t)\ ,d\pi (t) . $$ 

18.41. $$\{\underbrace{\overbrace{\mathstrut a,\ldots,a} 

~{k\ ; a\mathchar ‘ ’ \rm s} , 

\overbrace{\mathstrut b , \ldot s , b} 

~{1\ ;b\mathchar ‘ ’\rm s}}_{k+l\rm\ ; elements}\} . $$ 

Notice how apostrophes (instead of primes) were obtained. 

18.42. $$\pmatrix{\pmatrix{a&b\cr c&d\cr}& 

\pmatrix{e&f\cr g&h\cr}\cr 
\noalign{\smallskip} 

0&\pmatrix{i&j\cr k&l\cr}\cr} . $$ 

18.43. $$\det\left I \ , \matrix{ 

c_0&c_l\hf ill&c_2\hf ill&\ldots&c_n\hf ill\cr 
c_lftc_2\hf ill&c_3\hf ill&\ldots&c_{n+l}\hf ill\cr 
c_2&c_3\hf ill&c_4\hf ill&\ldots&c_{n+2}\hf ill\cr 
\ , \vdots\hf ill&\ , \vdots\hf illft 

\ ,\vdots\hf illftft\ ,\vdots\hf ill\cr 
c_n&c_{n+l}\hf illftc_{n+2}\hf ill&\ldots&c_{2n}\hf ill\cr 
}\right | >0. $$ 

18.44. $$\mathop{{\sum} ’ }_{x\in A}f (x) \mathrel{\mathop=*{\rm def}} 

\sum_{\scriptstyle x\in A\atop\scriptstyle x\ne0}f (x) . $$ 

This works because {\sum} is type Ord (so its superscript is not set above), but 
\mathop{{\sum} ’} is type Op (so its subscript is set below). The limits are centered 
on however, not on If you don’t like that, the remedy is more difficult; one 
solution is to use \sumprime_{x\in A} where \sumprime is defined as follows: 

\def\sumprime_#l{\setboxO=\hbox{$\scriptstyle{#l}$} 

\ setbox2=\hbox{$\di splay sty le{\sum}$} 

\setbox4=\hbox{${} ’ \mathsurround=Opt$} 

\dimen0=. 5\wd0 \advance\dimenO by-.5\wd2 
\ifdim\dimen0>0pt 
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\ifdim\dimen0>\wd4 \kern\wd4 \else\kern\dimenO\f i\f i 
\mathop{{\sum} ’ }_{\kern-\wd4 #1}} 

18.45. $$2\uparrow\uparrow k\mathrel{\mathop=~{\rm def}} 

2“{2~{2"{\cdot~{\cdot~{\cdot"2}}}}} 

\vbox{\hbox{$\Big\}\scriptstyle k$}\kernOpt} . $$ 

18.46. If you have to do a lot of commutative diagrams, you will want to define some 
macros like those in the first, few lines of this solution. The \matrix macro resets 
the baselines to \normalbaselines, because other commands like \openup might have 
changed them, so we redefine \normalbaselines in this solution. Some of the things 
shown here haven’t been explained yet, but Chapter 22 will reveal all. 

$$\def\normalbaselines{\baselineskip20pt 
\lineskip3pt \lineskiplimit3pt } 

\def\mapright#l{\ smashf 

\mathop{\longrightarrow}\limits~{#l}}} 

\def\mapdown#l{\Big\downarrow 

\rlap{$\vcenter{\hbox{$\scriptstyle#l$}}$}} 

\matrix{&&&&&&0\cr 

&&&&&&\mapdown{}\cr 

0&\mapr ight {}&{ \ c al 0} _C&\mapr ight \ i ot a& 

\cal E&\mapright\rho&\cal L&\mapright{}&0\cr 
&&\Big\Vert&&\mapdown\phi&&\mapdown\psi\cr 
0&\mapright{}&{\cal 0}_C&\mapright{}& 

\pi_*!\cal 0}_D&\mapright\delta& 

R~lf _*{\cal 0}_V(-D)&\mapright{}&0\cr 
&M&&&\mapdown{\theta_i\otimes\gamma“{-l}}\cr 
&&&&&&\hidewidth R"lf_*\bigl({\cal 0} 

_V (-iM)\bigr)\otimes\gamma~{-l}\hidewidth\cr 
&&&&&&\mapdown{}\cr 
&&&&&&0\cr}$$ 

19.1. $$\sum_{n=0}~\infty a_nz~n\qquad\hbox{converges if}\qquad 

I z | <\Bigl (\limsup_{n\to\infty}\root n\ ! \of { I a_n| }\ , \Bigr) “{-1} . $$ 

$${f (x+\Delta x)-f (x)\over\Delta x}\to f’(x) 

\qquad\hbox{as $\Delta x\to0$.}$$ 

$$\ I u_i\ | =1 ,\qquad u_i\cdot u_j=0\quad\hbox{if $i\ne j$.}$$ 

$$\it\hbox{The confluent image of}\quad\left\{ 

\matrix{\hbox{an arc}\hf ill\cr\hbox{a circle}\hf ill\cr 
\hbox{a fan}\hf ill\cr} 

\right\}\quad\hbox{is}\quad\left\{ 

\matrix{\hbox{an arc}\hf ill\cr 

\hbox{an arc or a circle}\hf ill\cr 
\hbox{a fan or an arc}\hf ill\cr}\right\} . $$ 

The first example includes \ ! and \ , to give slightly refined spacing; but the point of 
the problem was to illustrate the hbox, not to fuss over such extra details. The last 
example can be done much more simply using the ideas of Chapter 22, if you don’t 
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mind descending to the level of Tj?X primitives; for example, the first matrix could be 
replaced by 

\ ,\vcenter{\halign{#\hf il\cr an arc\cr a circle\cr a fan\cr}}\, 
and the second is similar. 

19.2. $$\textstyle y={l\over2}x$$. (Switching to text style is especially common 
in multiline formulas. For example, you will probably find occasions to use \textstyle 
on both sides of the &’s within an \eqalign.) 

19.3. The latter formula will be in text style, not display style. And even if you do 
type ‘$$\hbox{$\displaystyle{{formula) }$}$$’, the results are not quite the same, as 
we will see later: TpX will compress the glue in ‘$$(formula)$$’ if the formula is too 
wide to fit on a line at its natural width, but the glue inside \hbox{. . .} is frozen at 
its natural width. The \hbox version also invokes \everymath. 

19.4. One solution is to put the formula in an libox that occupies a full line: 

$$\leftline{\ indent $\di splay style 

l-{l\over2}+{l\over3}-{l\over4}+\cdots=\ln2$}$$ 

But this takes a bit of typing. If you make the definitions 

\def\leftdi splay# l$${\leftline{\ indent $\di splay sty le{#l}$}$$} 

\everydisplay{\leftdi splay} 

you can type ‘$$ (formula) $$’ as usual, and the formatting will be inserted automat- 
ically. (This doesn’t work with equation numbers; Appendix D illustrates how to 
handle them as well. ) 

19.5. $$\prod_{k\geO}{l\over (l-q"kz) }= 

\sum_{n\geO}z"n\bigg/\ ! \ ! \prod_{l\le k\le n} (l-q"k) . \eqno (16 ’ ) $$ 

19.6. \eqno\hbox{ (3 — 1)}. 

19.7. When you type an asterisk in math mode, plain TJjX considers * to be a 
binary operation. In the cases “(*)’ and ‘ (**)’, the binary operations are converted to 
type Ord, because they don’t appear in a binary context; but the middle asterisk in 
1 (***)’ remains of type Bin. So the result was ‘(* * *)’. To avoid the extra medium 
spaces, you can type ‘\eqno (*{*}*) ’; or you can change \mathcode‘*, if you never use 
* as a binary operation. 

19.8. Assuming that \hsize is less than 10000 pt, the natural width of this equation 
will be too large to fit on a line; also, \quad specifies glue at the left. Therefore ‘x = y' 
will appear exactly 1 em from the left, and ‘(5)’ will appear flush right. (The widths 
will satisfy w = z — q, d = 0, k = q — e = 18 mu.) In the case of \leqno, ‘(5)’ will 
appear flush left, followed by one quad of space in \textfont2, followed by one quad 
of space in the current text font, followed by ‘x = y\ 

19.9. (Note in particular that the final comes before the final ‘\cr’.) 

$$\eqalign{T(n) \le T(2“{\lceil\lg n\rceil}) 

&\le c (3“{\lceil\lg n\rceil}-2~{\lceil\lg n\rceil})\cr 
&<3c\cdot3~{\lg n}\cr 
&=3c\ , n“{\lg3} . \cr}$$ 
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19.10. $$\eqalign{P(x)&=a_0+a_lx+a_2x"2+\cdots+a_nx~n, \cr 

P (-x)ft=a_0-a_lx+a_2x~2-\cdots+(-l) “na_nx~n. \cr}\eqno (30)$$ 

19.11. Both sides of that equation are considered to be on the left, so you get results 

that look like this: . . 

f a = f(z) ] 

{ P = f(z 2 ) } ■ 

1 7 = /(* 3 ) J 

19.12. $$\leqalignno{\gcd(u, v)&=\gcd(v,u) ;&(9)\cr 

\gcd(u,v)ft=\gcd(-u,v) . ft(10)\cr}$$ 

19.13. $$\eqalignno{\biggl (\int_{-\infty}~\infty e"{-x~2}\ ,dx\biggr) "2 

&=\int_{-\infty}“\infty\int_{-\infty}~\infty 
e~{- (x~2+y~2)}\,dx\,dy\cr 

ft=\int_0~{2\pi}\int_0“\infty e~{-r"2}r\ ,dr\ ,d\theta\cr 
&=\int_0“{2\pi}\biggl (-{e“{-r'2}\over2} 

\bigg | _{r=0}“{r=\infty}\ , \biggr) \ , d\theta\cr 
ft=\pi .ft (11) \cr}$$ 

19.14. You get the displayed box 

x = y + z 

and 

2 2,2 

x = y + z . 

Reason: The ‘and’ occurs at the left of the \eqalign box, not at the left of the whole 
display, and the \eqalign box is centered as usual. 

19.15. By raising the equation number, he increased the line height, so T)gX put extra 
space between that line and the previous line when it calculated the inter-line glue. If 
he had said ‘\smash{\raise . . he wouldn’t have had that problem. 

19.16. $$\displaylines{\hf ill x\equiv x;\hfill\llap{(l)}\cr 

\hf ill\hbox{if}\quad x\equiv y\quad\hbox{then}\quad 
y\equiv x; \hfill\llap{ (2)}\cr 
\hf ill\hbox{if}\quad x\equiv y\quad\hbox{and}\quad 
yXequiv z\quad\hbox{then}\quad 
xXequiv z . \hf ill\llap{ (3)}\cr}$$ 

There’s also a trickier solution, which begins with 

$$\displaylines{x\equiv x; \hf il\llap{ (1) }\hf ilnegXcr 

19.17. $$\eqalignno{x_nu_l+\cdots+x_{n+t-l}u_t 

ft=x_nu_ 1+ (ax_n+c) u_2+\cdots\cr 

&\qquad+\bigl (a~{t-l}x_n+c (a"{t-2}+\cdots+l) \bigr)u_t\cr 
ft=(u_l+au_2+\cdots+a~{t-l}u_t)x_n+h(u_l , \ldots ,u_t) . 

\quad& (47 ) \cr}$$ 

You weren’t expected to insert the ‘Xquad’ on the last line; such refinements usually 
can’t be anticipated until you see the first proofs. But without that Xquad the ‘(47)’ 
would occur half a quad closer to the formula. 
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$$\displaylines{\quad\sum_{l\le j\le n}{l\over 

(x_j-x_l)\ldots(x_j-x_{j-l}) (x-x_j) (x_j-x_{j+l}) 

\ldots (x_ j -x_n) }\hf ill\cr 

\hf ill={l\over (x-x_l) \ldots (x-x_n)} . \ quad (27) \cr}$$ 

$$\def\\#l ; { (#1 ;q"2)_\infty} / to save typing 
\di splay sty le{q“{{l\over2}n(n+l) }\\ea; \\eq/a; \qquad\atop 
\hf ill\\caq/e ; \\cq"2\ ! /ae ; } 

\over (e ;q)_\infty (cq/ e ; q) _\infty$$ 

\def\mustnt{I must not talk in class. \par} 

\def\f ive{\mustnt\mustnt\mustnt\mustnt\mustnt} 

\def\twenty{\f ive\f ive\f ive\f ive} 

\def\punishment{\twenty\twenty\twenty\twenty\twenty} 

Solutions to more complicated problems of this type are discussed later. 

20.2. ABCAB. (The first \a expands into A\def\a{B. . this redefines \a, so the 
second \a expands into B. . ., etc.) At least, that’s what happens if \puzzle is en- 
countered when TjjX is building a list. But if \puzzle is expanded in an \edef or 
\message or something like that, we will see later that the interior \def commands are 
not performed while the expansion is taking place, and the control sequences following 
\def are expanded; so the result is an infinite string 

A\def A\def A\def A\def A\def A\def A\def A\def A... 

which causes TjjX to abort because the program’s input stack is finite. This example 
points out that a control sequence (e.g., \b) need not be defined when it appears in the 
replacement text of a definition. The example also shows that TjgX doesn’t expand a 
macro until it needs to. 

20.3. (xi, . . . ,x n ). Note that the subscripts are bold here, because the expansion 
(\bf x_l,\Mots,\bf x _ n ) doesn’t “turnoff” \bf. To prevent this, one should write 
\row{{\bf x}}; or (better), \row\xbold, in conjunction with \def \xbold{{\bf x}}. 

20.4. The catch is that the parameters have to percolate down to the \mustnt macro, 
if you extend the previous answer: 

\def\mustnt#l#2{I must not #1 in #2.\par} 

\def\f ive#l#2{\mustnt{#l}{#2} . . . \mustnt{#l}{#2}} 

\def\twenty#l#2{\f ive{#l}{#2} . . . \f ive{#l}{#2}} 
\def\punishment#l#2{\twenty{#l}{#2} . . . \twenty{#l}{#2}} 

When you pass parameters from one macro to another in this way, you need to enclose 
them in braces as shown. But actually this particular solution punishes Tj^X much 
more than it needs to, because it takes a lot of time to copy the parameters and read 
them again and again. There’s a much more efficient way to do the job, by defining 
control sequences: 

\def\mustnt{I must not \doit\ in \thatplace . \par} 
\def\punishment#l#2{\def\doit{#l}\def \thatplace{#2}'/, 
\twenty\twenty\twenty\twenty\twenty} 
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and by defining \five and \twenty without parameters as before. You can also delve 
more deeply into TjtXnicalities, constructing solutions that are more efficient yet; TjtX 
works even faster when macros communicate with each other via boxes. For example, 

\def\mustnt{\copyO } 

\def\punishment#l#2{\setbox0= 

\vbox{\strut I must not #1 in #2 . \strut}'/, 
\twenty\twenty\twenty\twenty\twenty} 

sets 100 identical paragraphs at high speed, because TpX has to process the paragraph 
and break it into lines only once. It’s much faster to copy a box than to build it up from 
scratch. (The struts in this example keep the interbaseline distances correct between 
boxed paragraphs, as explained in Chapter 12. Two struts are used, for if the message 
takes more than one line there will be a strut at both top and bottom. If it were known 
that each sentence will occupy only a single line, no struts would be needed, because 
interline glue is added as usual when a box created by \copy is appended to the current 
vertical list.) 

20.5. The ## feature is indispensable when the replacement text of a definition 
contains other definitions. For example, consider 

\def\a#l{\def\b##l{##l#l}} 

after which ‘\a!’ will expand to ‘\def\b#l{#l ! }’. We will see later that ## is also 
important for alignments; see, for example, the definition of \matrix in Appendix B. 

20.6. \def\a#{\b}. 

20.7. Let’s go slowly on this one, so that the answer will give enough background 
to answer all similar questions. The (parameter text) of the definition consists of 
the three tokens #1, #2, [i; the (replacement text) consists of the six tokens {i, #e, 
] 2 , ! 6 , #2, [i. (When two tokens of category 6 occur in the replacement text, the 
character code of the second one survives; the character code of a category-6 character 
is otherwise irrelevant. Thus, ‘\def\ ! #1 ! 2# [{##] ! ! #2] ’ would produce an essentially 
identical definition.) When expanding the given token list, argument #1 is in, since 
it is undelimited. Argument #2 is delimited by Ci , which is different from -C i , so it is 
set provisionally to { [y] ] ; but the outer “braces” are stripped off, so #2 reduces to the 
three tokens [i, yn, ] 2 - The result of the expansion is therefore 

fi #6 f 2 !e Ci yn 1 2 [l Zn > 2 . 

Incidentally, if you display this with \tracingmacros=l, says 

\ ! ! 1#2 [->{##] ! ! #2 [ 

#l<-x 
#2<- [y] 

Category codes are not shown, but a character of category 6 always appears twice in 
succession. A parameter token in the replacement text uses the character code of the 
final parameter in the parameter text. 

20.8. Yes indeed. In the first case, \a receives the meaning of \b that is current at 
the time of the \let. In the second case, \a becomes a macro that will expand into the 
token \b whenever \a is used, so it has the meaning of \b that is current at the time 
of use. You need Met, if you want to interchange the meanings of \a and \b. 
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20.9. (a) Yes. (b) No; any other control sequence can appear (except those declared 
as \outer macros). 

20.10. \def\overpaid{{\countO=\balance 

You have overpaid your tax by \dollaramount . 

\ifnum\count0<100 It is our policy to refund 
such a small amount only if you ask for it. 

\else A check for this amount is being mailed 
under separate cover. \fi}} 

20.11. The tricky part is to get the zero in an amount like ‘$2.0T. 

\def\dollaramount{\count2=\count0 \divide\count2 bylOO 
\$\number\count2 . '/, 

\multiply\count2 by-100 \advance\count2 by\count0 
\ifnum \count2<10 0\fi 
\number\count2 } 

20.12. \def\category#l{\if case\catcode ‘ #1 

escape\or begingroup\or endgroup\or math\or 
align\or endline\or parameter\or superscript\or 
subscript\or ignored\or space\or letter\or 
otherchar\or active\or comment\or invalid\fi} 

20.13. (a,b) True. (c,d) False. (e,f) True. In case (e), the (true text) starts with 
‘ue’. (g) The \ifx is false and the inner \if is true; so the outer \if becomes ‘\if 
True. . . which is false. (Interestingly, TJ^X knows that the outer \if is false even 
before it has looked at the \fi’s that close the \ifx and the inner \if.) 

20.14. One idea is to say 

\let\save=\c \let\c=0 \edef\a{\b\c\d} \let\c=\save 

because control sequences equivalent to characters are not expandable. However, this 
doesn’t expand occurrences of \c that might be present in the expansions of \b and \d. 
Another way, which is free of this defect, is 

\edef \next#l#2{\def #l{\b#2\d}} \next\a\c 

(and it’s worth a close look!). 

20.15. \toks0={\c} \toks2=\expandafter{\d} 

\edef \a{\b\the\toks0 \the\toks2 } 

(Notice that \expandafter expands the token after the left brace here.) 

20.16. The following shouldn’t be taken too seriously, but it does work: 
{\setboxO=\vbox{\halign{#{\c\span\d}\cr 

\let\next=0\edef \next#l{\gdef \next{\b#l}}\next\cr}}} 
\let\a=\next 

20.17. Neither one, although \a will behave like an unmatched left brace when it is 
expanded. The definition of \b is not complete, because it expands to ‘\def\b{{}’; 
T)gX will continue to read ahead, looking for another right brace, possibly discovering a 
runaway definition! It’s impossible to define a macro that has unmatched braces. But 
you can say \let\a={; Appendix D discusses several other brace tricks. 


expandafter 

span 

brace tricks 



Appendix A: Answers to All the Exercises 331 


endlinechar 
endgraf 
long 

nointerlineskip 
downbracefill 

\def\stripspace#l \next{#l} 

\edef \myname{\expandafter\stripspace\myname\next} 

The latter solution doesn’t work if the user types 7.’ at the end of his or her name, or 
if the name contains control sequences. 

20.19. Here are two solutions: 

\def\next#l\endname{\uppercase{\def \MYNAME{#1}}} 

\expandafter\next\myname\endname 

\edef \next{\def \noexpand\MYNAME{\myname}} 

\uppercase\expandafter{\next} 

20.20. (Here’s a solution that also numbers the lines, so that the number of repetitions 
is easily verifiable. The only tricky part about this answer is the use of \endgraf , which 
is a substitute for \par because \loop is not a \long macro.) 

\newcount\n 

\def\punishment#l#2{\n=0 

\loop\ifnum\n<#2 \advance\n byl 

\item{\number\n. }#l\endgraf\repeat} 

21.1. The interline skip is added for vboxes, but not for rules; he forgot to say 
\nointerlineskip, before and after the \moveright construction. 

21.2. \vrule height3pt depth-2pt widthlin. Notice that it was necessary to call 
it a \vrule since it appeared in horizontal mode. 

21.3. \def\boxit#l{\vbox{\hrule\hbox{\vrule\kern3pt 

\vbox{\kern3pt#l\kern3pt}\kern3pt\vrule}\hrule}} 

(The resulting box does not have the baseline of the original one; you have to work a 
little bit harder to get that.) 

21.4. \leaders: two boxes starting at 100 pt, 110 pt. 

\cleaders: three boxes starting at 95 pt, 105 pt, 115 pt. 

\xleaders: three boxes starting at 93 pt, 105 pt, 117 pt. 

21.5. \def\leaderf ill{\kern-0 . 3em\leaders\hbox to lem{\hss . \hss}'/, 

\hskip0.6em pluslfill \kern-0.3em } 

21.6. Since no height or depth specification follows the \vrule, the height and 
depth are i.e., the rule extends to the smallest enclosing box. This usually makes 
a heavy black band, which is too horrible to demonstrate here. However, it does work 
in the \downbracef ill macro of Appendix B; and \leaders\vrule\vf ill works fine 
in vertical mode. 


20.18. One way is to redefine \catcode ‘\“M=9 (ignored) just before the \read, so 
that the (return) will be ignored. Another solution is to redefine \endlinechar=-l, 
so that no character is put at the end of the line. Or you could try to be tricky by 
stripping off the space with macro expansion as follows: 
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21 . 7 . For example, say 

\null\nobreak\leaders\hrule\hskiplOpt pluslfilll\ \par 

The ‘V provides extra glue that is wiped out. by the implied \unskip at the end of 
every paragraph (see Chapter 14), and the ‘\null\nobreak’ makes sure that the leaders 
do not disappear at a line break; ‘filll’ overtakes the \parfillskip glue. 

21.8. $$\hbox to 2 . 5in{\cleaders 

\vbox to . 5in{\cleaders\hbox{\TeX}\vf il}\hf il}$$ 

21 . 9 . We assume that a strut is 12 pt tall, and that 50 lines fit on a page: 
\setboxO=\hbox{\strut I must not talk in class.} 

\null\cleaders\copy0\vskip600pt\vf ill\eject '/, 50 times on page 1; 
\null\cleaders\box0\vskip600pt\bye / 50 more on page 2. 

The \null keeps glue (and leaders) from disappearing at the top of the page. 

21 . 10 . {\let\the=0\edef \next{\write\cont{{token list)}}\next} will expand ev- 
erything but \the when the \write command is given. 

22 . 1 . Notice the uses of ‘\smallskip’ here to separate the table heading and footing 
from the table itself; such refinements are often worthwhile. 

\settabs\+\indent&10\fracl/2 lbs . \qquad&\it Servings\qquad&\cr 
\+&\negthinspace\it Weight&\it Servings^ 

{\it Approximate Cooking Time\/}*\cr 
\smallskip 

\+&8 lbs.&6&l hour and 50 to 55 minutes\cr 
\+&9 lbs.&7 to 8&About 2 hours\cr 

\+&9\fracl/2 lbs.&8 to 9&2 hours and 10 to 15 minutes\cr 
\+&10\fracl/2 lbs.&9 to 10&2 hours and 15 to 20 minutes\cr 
\smallskip 

\+&* For a stuffed goose, 

add 20 to 40 minutes to the times given. \cr 

The title line specifies ‘\it’ three times, because each entry between tabs is treated 
as a group by IfeX; you would get error messages galore if you tried to say something 
like ‘\+&{\it Weight&Servingsft. . . }\cr’. The ‘\negthinspace’ in the title line is a 
small backspace that compensates for the slant, in an italic W ; the author inserted this 
somewhat unusual correction after seeing how the table looked without it, on the first 
proofs. (You weren’t, supposed to think of this, but it has to be mentioned.) See 
exercise 11.6 for the ‘\frac’ macro; it’s better to say ‘ i/a’ than ‘M, in a. cookbook. 

Another way to treat this table would be to display it in a vbox, instead of 
including a. first column whose sole purpose is to specify indentation. 

22 . 2 . In such programs it seems best, to type \cleartabs just before &, whenever 
it is desirable to reset the old tabs. Multiletter identifiers look best, when set in text 
italics with \it, as explained in Chapter 18. Thus, the following is recommended: 

\+\bf while $p>0$ do\cr 

\+\quad\cleartabs&{\bf begin} $q:={\it link}(p)$; 

${\it free\_node} (p)$ ; $p:=q$;\cr 
\+&{\bf end};\cr 
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22.3. Here we retain the idea that & inserts a new tab, when there are no tabs to the 
right of the current position. Only one of the macros that are used to process \+ lines 
needs to be changed; but (unfortunately) it’s the most complex one: 

\def\t@bb@x{\if @cr\egroup '/, now \boxO holds the column 
\else\hss\egroup \dimen@=0\p@ 

\dimen@ii=\wdO \advance\dimen<3ii bylsp 
\loop\ifdim \dimen@<\dimen@ii 

\global\setbox\tabsyet=\hbox{\unhbox\tabsyet 

\global\setboxl=\lastbox}*/, 

\ifvoidl \advance\dimen@ii by-\dimen@ 

\advance\dimen@ii by-lsp \global\setboxl 
=\hbox to\dimen@ii{}\dimen@ii=-lpt\f i 
\advance\dimen@ by\wdl \global\setbox\tabsdone 
=\hbox{\boxl\unhbox\tabsdone}\repeat 
\setboxO=\hbox to\dimen@{\unhboxO}\f i 
\boxO} 

22.4. Horizontal lists Chapter 14 

Vertical lists Chapter 15 

Math lists Chapter 17 (i.e., the first column would be right-justified) 

22.5. Fowl&Poule de 1 ’ Ann\ ’ee&10 to 12&0ver 3&Stew, Fricassee\cr 

22.6. $$\halign to\hsize{\sl#\hf il\tabskip= . 5em plus.5em& 

#\hf il\tabskip=Opt plus.5em& 

\hf il#\tabskip=lem plus2em& 

\sl#\hf il\tabskip=. 5em plus.5em& 

#\hf il\tabskip=Opt plus.5em& 

\hf il#\tabskip=Opt\cr ...}$$ 

22.7. The trick is to define a new macro for the preamble: 

$$\def\welshverb#l={{\bf#l} = } 

\halign to\hsize{\welshverb#\hf il\tabskip=lem pluslemfe 
\welshverb#\hf il&\welshverb#\hf il\tabskip=Opt\cr . . .}$$ 

22.8. \hfil#: &\vtop{\parindent=0pt\hsize=16em 

\hangindent . 5em\strut#\strut}\cr 

With such narrow measure and such long words, the \tolerance should probably also 
have been increased to, say, 1000 inside the \vtop; luckily it turned out that a higher 
tolerance wasn’t needed. 

Note: The stated preamble solves the problem and demonstrates that T^X’s 
line-breaking capability can be used within tables. But this particular table is not 
really a good example of the use of \halign, because Tj?X could typeset it directly, 
using \everypar in an appropriate manner to set up the hanging indentation, and using 
\par instead of \cr. For example, one could say 

\hsize20em \parindentOpt \clubpenaltylOOOO \widowpenaltylOOOO 
\def\history#l&{\hangindent4. 5em 

\hbox to4em{\hss#l : }\ignorespaces} 

\everypar={\history} \def\\{\leavevmode{\it c\/}} 


tolerance 

vtop 

strut 

everypar 
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which spares TjrX all the work of \halign but yields essentially the same result. 

22.9. The equation is divided into separate parts for terms and plus/minus signs, 
and tabskip glue is used to center it: 

$$\openupl\ jot \tabskip=Opt pluslfil 
\halign to\displaywidth{\tabskip=Opt 
$\hf il#$ft$\hf il{}#{}$& 

$\hf il#$&$\hf il{}#{}$& 

$\hf il#$&$\hf il{}#{}$& 

$\hf il#$&${}#\hf il$\tabskip=Opt plus If ilft 
\llap{#}\tabskip=Opt\cr 
10w&+&3x&+&3y&+&18z&=l , & (9) \cr 
6w&-&17x&&&-&5z&=2 . & ( 10) \cr}$$ 

\hf il# &#\hfil&\quad#&\ \hfil#&\ \hfil#\cr 

\pmatrix{a_{ll}&a_{12}&\ldots&a_{ln}\cr 
a_{21}&a_{22}&\ldots&a_{2n}\cr 
\multispan4\dotf ill\cr 
a_{ml}&a_{m2}&\ldots&a_{mn}\cr} 

‘\cr’ would have omitted the final column, which is a vertical rule. 

One way is to include two lines just before and after the title line, saying 
‘\omit&height2pt&\multispan5&\cr’. Another way is to put \bigstrut into some 
column of the title line, for some appropriate invisible box \bigstrut of width zero. 
Either way makes the table look better. 

22.14. The trick is to have “empty” columns at the extreme left and right; then the 
Xhrulefill’s are able to span the tabskip glue. 

$$\vbox{\tabskip=Opt \off inter line skip 
\halign to 36em{\tabskip=0pt pluslem#& 

#\hf il&#&#\hf il&#&#\hf il&#\tabskip=Opt\cr 
&&&&&\strut J. H. BV'ohning, 1838ft\cr 
&&&&\multispan3\hrulef ill\cr 
&&&\strut M. J. H. BV'ohning, 1882&\vrule\cr 
&&\multispan3\hrulef ill\cr 

&&\vrule&&\vrule&\strut M. D. Blase, 1840&\cr 
&&\vrule&&\multispan3\hrulef ill\cr 
&\strut L. M. Bohning, 1912&\vrule\cr 
\multispan3\hrulef ill\cr 
&&\vrule&ft&\strut E. F. Ehlert , 1845&\cr 
&&\vrule&&\multispan3\hrulef ill\cr 
&ft\vrule&\strut P. A. M. Ehlert, 1884&\vrule\cr 
&&\multispan3\hrulef ill\cr 

&&&&\vrule&\strut C. L. Wischmeyer, 1850&\cr 
&M&\multispan3\hrulef ill\cr}}$$ 

22.oo. (Solution to Dudeney’s problem.) Let \one and \two be macros that produce 
a vertical list denoting one and two pennies, respectively. The problem can be solved 


22 . 10 . 

22 . 11 . 

22 . 12 . 

22.13. 


leave vmode 
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valign 

interline glue 

nobreak 

insertions 


23.2. \headline={\ifnum\pageno=l \hss\tenbf R\ ’ESUM\ ’E\hss 

\else\tenrm RVesumVe of A. U. Thor \dotfill\ Page \folio\fi} 

(You should also say \nopagenumbers and \voffset=2\baselineskip.) 

23.3. \output={\plainoutput\blankpageoutput} 
\def\blankpageoutput{\shipout\vbox{\makeheadline 

\vbox to\vsize{}\makefootline}\advancepageno} 

23.4. Set \hsize=2 . lin, allocate ‘\newbox\midcolmmT, and use the following code: 
\output={\if L\lr 

\global\setbox\leftcolumn=\coluranbox \global\let\lr=M 
\else\if M\lr 

\global\setbox\midcoluran=\columnbox \global\let\lr=R 
\else \tripleformat \global\let\lr=L\f i\f i 
\ifnum\outputpenalty>-20000 \else\dosupere ject\f i} 
\def\tripleformat{\shipout\vbox{\makeheadline 

\fullline{\box\leftcolumn\hf il\box\midcolumn\hf il\columnbox} 
\makef ootl ine} 

\advancepageno} 

At the end, \supereject and say ‘\if L\lr \else\null\vf ill\eject\fi’ twice. 

23.5. He forgot that interline glue is inserted automatically before the \leftline; 
this permits a legal breakpoint between the \mark and the \leftline box, according 
to the rules of page breaking in Chapter 15. One cure would be to say \nobreak just 
after the \mark; but it’s usually best to put marks and insertions just after boxes. 

23.6. Say, for example, \if case2\expandafter\relax\botmark\f i to read part a.i 
of \botmark. Another solution puts the five components into five parameters of a 
macro, analogous to the method used by \inxcheck later in this chapter; but the 
\ifcase approach is usually more efficient, because it lets IjgX pass over the unselected 
components at high speed. 

23.7. \output={\dimen0=\dp255 \normaloutput 

\if odd\pageno\else\if L\lr 

\expandafter\inxcheck\botmark\sub\end\f i\f i} 

In this case the \normaloutput macro should be the two-column output routine that 
was described earlier in this chapter, beginning with ‘\if L\lr’ and ending with 
‘\let\lr=L\f i’. (There is no need to test for \supere ject.) 


with \valign as follows: 

\valign{\vfil#&\vfil#&\vfil#&\vfil#\cr 

\two&\one&\one&\one\cr 

\one&\one&\two&\one\cr 

\one&\one&\one&\two\cr 

\one&\two&\one&\one\cr} 

Since \valign transposes rows and columns, the result is 


QOQQ 
Q Q Q Q 
Q Q Q Q 
Q Q Q Q 


23.1. \footline={\hss\tenrm — \folio\ — \hss} 
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23.8. False. If the text of the main and/or subsidiary entry is lengthy, a continuation 
line may actually become two or more lines. (Incidentally, hanging indentation will 
then occur, because the \everypar command — which was set up outside the \output 
routine — is effective inside.) The \vsize must be large enough to accommodate all 
continuation lines plus at least one more line of index material, or else infinite looping 
will occur. 

24.1. If \cs has been defined by \chardef or \mathchardef , T^rX uses hexadecimal 
notation when it expands \meaning\cs, and it assigns category 12 to each digit of 
the expansion. You might have an application in which you want the last part of the 
expansion to be treated as a (number). (This is admittedly an obscure reason.) 

24.2. Yes; any number of spaces can precede any keyword. 

24.3. The first two have the same meaning; but the third coerces \baselineskip to 
a (dimen) by suppressing the stretchability and shrinkability that might be present. 

24.4. The natural width is 221 dd (which Tf-'X rounds to 15497423 sp and displays 
as 236.47191pt). The stretchability is 2500 sp, since an internal integer is coerced to a 
dimension when it appears as an (internal unit). The shrinkability is zero. Notice that 
the final \space is swallowed up as part of the optional spaces of the (shrink) part in 
the syntax for (glue). (If PLUS had been MINUS, the final \space would not have been 
part of this (glue)!) 

24.5. If it was non-null when a \dump operation occurred. Here’s a nontrivial exam- 
ple, which sets up \batchmode and puts \end at the end of the input file: 

\everyjob={\batchmode\input\jobname\end} 

24.6. (a) \def \\#l\\{}\futurelet\cs\\ u \\. (b) \def \\{\let\cs= }\\ u . (There 
are many other solutions.) 

24.7. (internal quantity) — t (internal integer) | (internal dimen) 

| (internal glue) | (internal niuglue) | (internal nonnumeric) 

(internal nonnumeric) — > (token variable) | (font) 

26.1. Radix 10 notation is used for numeric constants and for the output of numeric 
data. The first 10 \count registers are displayed at each \shipout, and their values 
are recorded on the dvi file at such times. A box whose glue has stretched or shrunk to 
its stated stretchability or shrinkability has badness 100; this badness value separates 
“loose” boxes from “very loose” or “underfull” ones. will scroll up to 100 errors in a 
single paragraph before giving up (see Chapter 27). The normal values of \spacef actor 
and \mag are 1000. A \prevdepth value of — 1000 pt suppresses interline glue. The 
badness rating of a box is at most 10000, except that the \badness of an overfull box 
is 1000000. INITEX initializes \tolerance to 10000, thereby making all line breaks 
feasible. Penalties of 10000 or more prohibit breaks; penalties of —10000 or less make 
breaks mandatory. The cost of a page break is 100000, if the badness is 10000 and if 
the associated penalties are less than 10000 in magnitude (see Chapter 15). 

26.2. T)7X allows constants to be expressed in radix 8 (octal) or radix 16 (hexadeci- 
mal) notation, and it uses hexadecimal notation to display \char and \mathchar codes. 
There are 16 families for math fonts, 16 input streams for \read, 16 output streams for 
\write. A \catcode value must be less than 16. The notation **?, “A specifies 


chardef 
mathchardef 
hexadecimal notation 
meaning 

coerce jnumber^ to jdimen 

spaces 

dump 

jobname 

batchmode 

end 
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characters whose codes differ by 64 from the codes of ?, Q, A; this convention applies 
only to characters with ASCII codes less than 128. There are 256 possible characters, 
hence 256 entries in each of the \catcode, \mathcode, \lccode, \uccode, \sf code, and 
\delcode tables. All \lccode, \uccode, and \char values must be less than 256. A 
font has at most 256 characters. There are 256 \box registers, 256 \ count registers, 
256 \dimen registers, 256 \skip registers, 256 \muskip registers, 256 \toks registers, 
256 hyphenation tables. The “at size” of a font must be less than 2048 pt, i.e. , 2 11 pt. 
Math delimiters are encoded by multiplying the math code of the “small character” 
by 2 12 . The magnitude of a (dimen) value must be less than 16384 pt, i.e., 2 14 pt; simi- 
larly, the (factor) in a (fil dimen) must be less than 2 14 . A \mathchar or \spacef actor 
or \sfcode value must be less than 2 la ; a \mathcode or \mag value must be less than 
or equal to 2 10 , and 2 10 denotes an “active” math character. There are 2 16 sp per pt. 
A \delcode value must be less than 2 24 ; a \delimiter, less than 2 27 . The \end com- 
mand sometimes contributes a penalty of — 2 30 to the current page. A (dimen) must be 
less than 2 30 sp in absolute value; a (number) must be less than 2 31 in absolute value. 

27.1. He forgot to count the space; TjrpC deleted ‘i’, ‘m’, ‘ u \ ‘\input’, and four letters. 
(But all is not lost; he can type ‘T or ‘2’, then (return), and after being prompted by 

he can enter a new line of input.) 

27.2. First delete the unwanted tokens, then insert what you want: Type ‘6’ and 
then ‘I\macro’. (Incidentally, there’s a sneaky way to get at the \inaccessible control 
sequence by typing 

I\garbage{}\let\accessible= 

in response to an error message like this. The author designed TjrjX in such a way that 
you can’t destroy anything by playing such nasty tricks.) 

27.3. T u ’/.’ does the trick, if '/, is a comment character. 

27.4. The ‘minus’ of ‘minuscule’ was treated as part of the \hskip command in 
\nextnumber. Quick should put ‘\relax’ at the end of his macro. (The keywords 1, 
plus, minus, width, depth, or height might just happen to occur in text when TJgX is 
reading a glue specification or a rule specification; designers of general-purpose macros 
should guard against this. If you get a ‘Missing number’ error and you can’t guess 
why Tj?X is looking for a number, plant the instruction ‘\tracingcommands=l’ shortly 
before the error point; your log file will show what command TjtX is working on.) 

27.5. If this exercise isn’t just a joke, the title of this appendix is a lie. 


If you can't solve a problem, 
you can always look up the answer. 
But please, try first to solve it by yourself; 
then you’ll learn more and you'll learn faster. 
— DONALD E. KNUTH, The T E XbOOk (1983) 
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Knuth 

comment character 

minus 

keywords 

1 

plus 

width 

depth 

height 

Missing number 
KNUTH 

SHAKESPEARE 


How answer you for your selues? 
— WILLIAM SHAKESPEARE, Much Adoe About Nothing (1598) 
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Let’s begin this appendix with a chart that summarizes plain Tf^X’s conventions. 


Characters that are reserved for special purposes: \ { } $ & # "/, 


\rm roman, 

{\sl slanted}, 

{\bf boldface}, 

{\it italic\/} type 

roman, 

slanted, 


boldface, 

italic type 


n ) ) __ 

?‘ ! ‘ \$ \# 

\k 

\°/ t \ae \AE \oe 

\0E \aa \AA \ss 

\o \0 

U 55 

l i $ # 

& 

% S I (E 

(E a A fi 

0 0 

Va Ve \~o 

\"u \=y \~n 

\.p 

\u\i \v s \H\ j 

\t\i u \b k \c c 

\d h 

a e 6 

ii y n 

P 

1 S j 

iu k q 

h 

\1 \L \dag 

\ddag \S 

\P 

{\it\$ \&} \copyright \TeX 

\dots 

1 L f 

t § 

If 

£ & 

© TeX 


Line break controls: \break 

\nobreak \allowbreak \hbox{unbreakable} 

dis\-cre\-tion\-ary hy\-phens 

vi 

rgule\slash breakpoint 

Breakable horizontal spaces: 


Unbreakable horizontal spaces: 


\ u normal interword space 


normal interword space 


\enskip this 

much 


\enspace 

this much 


\quad this 

much 


\thinspace 

this much 


\qquad this 

much 


\negthinspace thianuch 


\hskip (arbitrary dimen) 


\kern (arbitrary dimen) 



summary of plain TeX 
+ 

tabbing 

accents 


Vertical spaces: \smallskip = \medskip \bigskip 


Page break controls: \eject \supereject \nobreak \goodbreak \filbreak 
Vertical spaces and good breakpoints: \smallbreak \medbreak \bigbreak 


\settabs 4 \columns 

\+Here’s an example&of \hf ill some &tabbing:&\hrulef ill&\cr 

Here’s an example of some tabbing: 

\hrulefill \dotfill 

\leftarrowf ill t \rightarrowf ill > 

\upbracefill v - \downbracef ill , ^ s 

More general alignments use \halign, \valign, \omit, \span, and \multispan. 


Examples of the principal conventions for text layout appear on the next page. 
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7, This test file generates the output shown on the opposite page. 

7, It’s a bit complex because it tries to illustrate lots of stuff. 
7, TeX ignores commentary (like this) that follows a sign. 

7 , First the standard output style is changed slightly: 

\hsize=29pc 7. The lines in this book are 29 picas wide. 

\vsize=42pc 7. The page body is 42 picas (not counting footlines) . 
\f ootline={\tenrm Footline\quad\dotf ill\quad Page \folio} 
\pageno=1009 7 , This is the starting page number (don’t ask why). 

7 See Chapter 23 for the way to make other page format changes via 
7 \hoffset, \voffset, \nopagenumbers , \headline, or \raggedbottom. 

\topglue lin 7. This makes an inch of blank space (lin=2.54cm) . 
\centerline{\bf A Bold, Centered Title} 

\smallskip 7. This puts a little extra space after the title line. 
\rightline{\it avec un sous-titre \‘a la fran\c caise} 

7, Now we use \beginsection to introduce part 1 of the document. 
\beginsection 1. Plain \TeX nology 7. The next line must be blank! 

The first paragraph of a new section is not indented. 

\TeX\ recognizes the end of a paragraph when it comes to a blank 
line in your manuscript file. 7. or to a ‘\par’: see below. 

Subsequent paragraphs {\it are\/} indented. \f ootnote*{The amount 
of indentation can be changed by changing a parameter called 
{\tt\char ‘ Wparindent} . Turn the page for a summary of \TeX’s most 
important parameters.} (See?) The computer breaks a paragraph’s 

text into lines in an interesting way see ref erence” [1] and h7 0 

yphenates words automatically when necessary. 

\midinsert 7. This begins inserted material, e.g., a figure. 
\narrower\narrower 7. This brings the margins in (see Chapter 14) . 
\noindent \llap{‘‘}If there hadn’t been room for this material on 
the present page, it would have been inserted on the next one.’’ 
\endinsert 7. This ends the insertion and the effect of \narrower. 

\proclaim Theorem T. The typesetting of $math$ is discussed in 
Chapters 16 — 19, and math symbols are summarized in Appendix~F. 

\beginsection 2. Bibliography\par 7. ‘\par’ acts like a blank line. 
\frenchspacing 7. (Chapter 12 recommends this for bibliographies.) 
\item{[l]} D.~E. Knuth and M.~F. Plass, ‘ ‘Breaking paragraphs 
into lines,’’ {\sl Softw. pract . exp. \bfll} (1981), 1119 — 1184. 
\bye 7. This is the way the file ends, not with a \bang but a \bye. 


hsize 

vsize 

footline 

pageno 

topglue 

centerline 

smallskip 

rightline 

beginsection 

blank line 

TeX 

par 

footnote 

tt 

char 

percent 

midinsert 

noindent 

llap 

proclaim 

frenchspacing 

item 

Plass 

Knuth 

bye 
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A Bold, Centered Title 

avec un sous-titre a la frangaise 


1. Plain T^Xnology 

The first paragraph of a new section is not indented. TgX recognizes the end of 
a paragraph when it comes to a blank line in your manuscript file. 

Subsequent paragraphs are indented.* (See?) The computer breaks a para- 
graph’s text into lines in an interesting way — see reference [1] — and hyphenates 
words automatically when necessary. 

“If there hadn’t been room for this material on the present 
page, it would have been inserted on the next one.” 

Theorem T. The typesetting of math is discussed in Chapters 16-19, and math 
symbols are summarized in Appendix F. 

2. Bibliography 

[1] D. E. Knuth and M. F. Plass, “Breaking paragraphs into lines,” Softw. 
pract. exp. 11 (1981), 1119-1184. 


* The amount of indentation can be changed by changing a parameter called 
\parindent. Turn the page for a summary of TgX’s most important parameters. 


Footline 


Page 1009 
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The preceding example illustrates most of the basic things that you can 
do directly with plain TpjX . but it does not provide an exhaustive list. Thus, 
it uses \centerline and \rightline, but not \leftline or \line; it uses 
\midinsert, but not \topinsert or \pageinsert; it uses \smallskip, but not 
\medskip or \bigskip; it uses \llap but not \rlap, \item but not \itemitem, 
\topglue but not \hglue. It does not illustrate \raggedright setting of para- 
graphs; it does not use \obeylines or \obeyspaces to shut off TgX’s automatic 
formatting. All such control sequences are explained later in this appendix, and 
further information can be found by looking them up in the index. The main 
purpose of the example is to serve as a reminder of the repertoire of possibilities. 

Most of the control sequences used in the example are defined by macros 
of plain T^X format, but three of them are primitive, i.e., built in: ‘\par’ (end of 
paragraph), ‘\noindent’ (beginning of non-indented paragraph), and ‘\/’ (italic 
correction). The example also assigns values to two of T^X’s primitive param- 
eters, namely \hsize and \vsize. TgX has scores of parameters, all of which 
are listed in Chapter 24, but only a few of them are of special concern to the 
majority of TgX users. Here are examples of how you might want to give new 
values to the most important parameters other than \hsize and \vsize: 

\tolerance=500 (TgX will tolerate lines whose badness is rated 500 or less.) 
\looseness=l (The next paragraph will be one line longer than usual.) 
\parindent=4mm (Paragraphs will be indented by four millimeters.) 
\hoffset=1.5in (All output will be shifted right by one and a half inches.) 
\voff set=24pt (All output will be shifted down by 24 points.) 
\baselineskip=llpt plus. Ipt (Baselines will be 11 pt apart, or a bit more.) 
\parskip=3pt pluslpt minus . 5pt (Extra space will precede each paragraph.) 

Plain TgX uses \par indent also to control the amount of indentation provided 
by \item, \itemitem, and \narrower. 

f Tlie remainder of this appendix is devoted to the details of the plain TpX 
format, which is a set of macros that come with normal implementations of 
TpX. These macros serve three basic, purposes: (1) They make TjjX usable, because 
IpX’s primitive capabilities operate at a very low level. A “virgin” IjrX system that 
has no macros is like a newborn baby that has an immense amount to learn about the 
real world; but it is capable of learning fast. (2) The plain TjgX macros provide a basis 
for more elaborate and powerful formats tailored to individual tastes and applications. 
You can do a lot with plain TpX, but pretty soon you’ll want to do even more. (3) The 
macros also serve to illustrate how additional formats can be designed. 

Somewhere in your computer system you should be able to find a file called 
plain.tex that contains exactly what has been preloaded into the running TjrX system 
that you use. Our purpose in the rest of this appendix will be to discuss the contents 
of plain.tex. However, we will not include a verbatim description, because some 
parts of that file are too boring, and because the actual macros have been “optimized” 
with respect to memory space and running time. Unoptimized versions of the macros 
are easier for humans to understand, so we shall deal with those; plain.tex contains 
equivalent constructions that work better on a machine. 


obeylines 

obeyspaces 

as is, see obeylines, obeyspaces 

primitive 

parameters 

tolerance 

looseness 

parindent 

hoffset 

voffset 

baselineskip 

parskip 

item 

itemitem 

narrower 

plain.tex 

optimized 

efficiency 
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So here’s the plan for the rest of Appendix B: We shall go through the con- 
tents of plain.tex, interspersing an edited transcription of that file with comments 
about noteworthy details. When we come to macros whose usage has not yet been 
explained — for example, somehow \vglue and \beginsection never made it into Chap- 
ters 1 through 27 — we shall consider them from a user’s viewpoint; but most of the 
time we shall be addressing the issues from the standpoint of a macro designer. 

1. The code tables. A format’s first duty is to establish \catcode values. This is nec- 
essary because, for example, a \def command can’t be used until there are characters 
like { and } of categories 1 and 2. The INITEX program (which reads plain.tex so 
that TgX can be initialized) begins without knowing any grouping characters; hence 
plain.tex starts out as follows: 

'/, This is the plain TeX format that’s described in The TeXbook. 

'/, N.B.: A version number is defined at the very end of this file; 

/ please change that number whenever the file is modified! 

*/, And don’t modify the file under any circumstances. 

\catcode ‘\{=1 */. left brace is begin-group character 
\catcode ‘\}=2 */, right brace is end-group character 
\catcode ‘\$=3 */, dollar sign is math shift 
\catcode ‘\&=4 */, ampersand is alignment tab 
\catcode ‘\#=6 */, hash mark is macro parameter character 

\catcode ‘\~=7 \catcode ‘ \~~K=7 */. circumflex and uparrow for superscripts 
\catcode ‘\_=8 \catcode‘\~~A=8 '/, underline and downarrow for subscripts 
\catcode ‘\“I=10 ’/. ASCII tab is treated as a blank space 
\chardef \active=13 \catcode ‘ \~=\active */, tilde is active 

\catcode ‘\“L=\active \outer\def “L{\par} */, ASCII form-feed is \outer\par 
\message{Preloading the plain format: codes,} 

These instructions set up the nonstandard characters **K and “A for superscripts and 
subscripts, in addition to * and _, so that people with extended character sets can 
use t and 1 as recommended in Appendix C. Furthermore “I (ASCII (tab)) is given 
category 10 (space); and ~~L (ASCII (formfeed)) becomes an active character that 
will detect runaways on files that have been divided into “file pages” by (formfeed) 
characters. The control sequence \active is defined to yield the constant 13; this is 
the one category code that seems to deserve a symbolic name, in view of its frequent 
use in constructing special-purpose macros. 

When INITEX begins, category 12 (other) has been assigned to all 256 possible 
characters, except that the 52 letters A. . .Z and a. . .z are category 11 (letter), and a 
few other assignments equivalent to the following have been made: 

\catcode ‘\\ =0 \catcode‘\ =10 \catcode ‘V/. =14 
\catcode‘\~~@=9 \catcode ‘ \“M=5 \catcode ‘ \“?=15 

Thus ‘V is already an escape character, ‘ u ’ is a space, and 7,’ is available for comments 
on the first line of the file; ASCII (null) is ignored, ASCII (return) is an end-of-line 
character, and ASCII (delete) is invalid. 

The \message command shown above prints a progress report on the terminal 
when plain.tex is being input by INITEX. Later on conies ‘\message{registers , 
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catcode 

INITEX 

superscripts 

subscripts 

uparrow char 

downarrow char 

ASCII 

tab 

file pages 

formfeed 

active 

backslash 

space 

null 

return 

delete 

message 
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and several other messages, but we won’t mention them specifically. The terminal will 
eventually display something like this when initialization is complete: 

** plain 

(plain.tex Preloading the plain format: codes, registers, 
parameters, fonts, more fonts, macros, math definitions, 
output routines, hyphenation (hyphen.tex)) 

* \dump 

Beginning to dump on file plain. fmt 

followed by a variety of statistics about what fonts were loaded, etc. If you want to 
make a new format super.tex that adds more features to plain.tex, it’s best not 
to make a new file containing all the plain stuff, or even to \input plain; just type 
‘ftplain super’ in response to INITEX’s ** prompt, to input plain. fmt at high speed. 

After the opening \message, plain.tex goes on to define a control sequence 
\dospecials that lists all the characters whose catcodes should probably be changed 
to 12 (other) when copying things verbatim: 

\def \dospecials{\do\ \do\\\do\{\do\}\do\$\do\&'/, 
\do\#\do\~\do\“K\do\_\do\“~A\do\'/Ado\~} 

(Appendix E illustrates how to use \dospecials.) The ASCII codes for (null), (tab), 
(linefeed), (formfeed), (return), and (delete) have not been included in the list. 

At this point plain.tex completes its initialization of category codes by set- 
ting \catcode ‘\@=11, thereby making the character behave temporarily like a letter. 
The command \catcode ‘ \@=12 will appear later, hence at-sign characters will act just 
like ordinary punctuation marks when TpyX is running. The idea is to make it easy 
for plain TjhX to have private control sequences that cannot be redefined by ordinary 
users; all such control sequences will have at least one in their names. 

The next job is to set up the \mathcode table: 


\mathcode ‘ \ ~~@="2201 
\mathcode ‘ \““C="010C 
\mathcode ‘ \~ ~F="3232 
\mathcode ‘\"~I="010D 
\mathcode ‘ \~~L="2206 
\mathcode ‘ \~~0="0140 
\mathcode ‘ \~~R="225C 
\mathcode ‘ \* *U=" 0239 
\mathcode ‘ \~‘X="3220 
\mathcode‘\“ [="2205 
\mathcode ‘ \ ="3211 

\mathcode‘\ ="8000 
\mathcode ‘ \ (="4028 
\mathcode ‘ \+="202B 
\mathcode ‘ \ . ="013A 
\mathcode ‘ \ ; ="603B 
\mathcode ‘ \>="313E 
\mathcode ‘ \\="026E 
\mathcode ‘ \{="4266 


\mathcode ‘ \“A="3223 
\mathcode ‘ \““D="225E 
\mathcode ‘ \~~G="0119 
\mathcode ‘\""J="010E 
\mathcode ‘ \~~M="2208 
\mathcode ‘ \""P="321A 
\mathcode ‘ \~~S="225B 
\mathcode ‘ \“V="220A 
\mathcode ‘ \ “Y="3221 
\mathcode ‘ \“\="3214 
\mathcode ‘ \“_="225F 
\mathcode ‘ \ ! ="5021 
\mathcode ‘ \) ="5029 
\mathcode ‘ \ , ="613B 
\mathcode ‘ \/="013D 
\mathcode ‘ \<="313C 
\mathcode ‘ \?="503F 
\mathcode ‘ \] ="505D 
\mathcode ‘ \ I ="026A 


\mathcode ‘ \~~B="010B 
\mathcode ‘ \~~E="023A 
\mathcode ‘ \~~H="0115 
\mathcode ‘ \“~K="3222 
\mathcode ‘ \**N="0231 
\mathcode ‘ \~~Q="321B 
\mathcode ‘ \“T="0238 
\mathcode ‘ \“W="3224 
\mathcode ‘ \* *Z=" 8000 
\mathcode‘\“]="3215 
\mathcode ‘ \**?=" 1273 
\mathcode ‘ \ ’ ="8000 
\mathcode ‘ \*="2203 
\mathcode ‘ \-="2200 
\mathcode ‘ \ : ="303A 
\mathcode ‘ \=="303D 
\mathcode ‘ \ [="405B 
\mathcode ‘ \_="8000 
\mathcode ‘ \}="5267 
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A mathcode is relevant only when the corresponding category code is 11 or 12; therefore 
many of these codes will rarely be looked at. For example, the math code for ~*M 
specifies the character \oplus, but it’s hard to imagine a user who would want “M 
(ASCII (return)) to produce an © sign in the middle of a math formula, since plain 
T)gX appends “M to the end of every line of input. The math codes have been set up 
here, however, to be entirely consistent with the extended character set presented in 
Appendix C and the Computer Modern fonts described in Appendix F. INITEX has 
done the rest of the work, as far as matlicodes are concerned: It has set \mathcode x = 
x + "7000 for each of the ten digits x = ‘0 to ‘9; \mathcode x = x + "7100 for each of 
the 52 letters; and \mathcode x = x for all other values of x. 

There’s no need to change the \uccode and \lccode tables. INITEX has made 
\uccode‘X=‘X, \uccode‘x=‘X, \lccode ‘X=‘x, \lccode‘x=‘x, and it has made similar 
assignments for all other letters. The codes are zero for all nonletters. These tables are 
used by TjyX’s \uppercase and \lowercase operations, and the hyphenation algorithm 
also looks at \lccode (see Appendix H). Changes should be made only in format 
packages that set TgX up for languages with more than 26 letters (see Chapter 8). 

Next comes the \sfcode table, which INITEX has initialized entirely to 1000, 
except that \sf code ‘X=999 for each of the 26 uppercase letters. Some characters are 
made “transparent” by setting 

\sfcode ‘ \) =0 \sfcode‘\’=0 \sfcode‘\]=0 / won’t change the space factor 

and the \nonfrenchspacing macro will be used later to change the sfcodes of special 
punctuation marks. (Chapter 12 explains what an \sfcode does.) 

The last code table is called \delcode, and again it’s necessary to change only 
a few values. INITEX has made all delimiter codes equal to —1, which means that no 
characters are recognized as delimiters in formulas. But there’s an exception: The value 
\delcode‘\.=0 has been prespecified, so that stands for a “null delimiter.” (See 
Chapter 17.) Plain format sets up the following nine values, based on the delimiters 
available in Computer Modern: 

\delcode‘\ (=”028300 \delcode ‘\/=”02F30E \delcode ‘\)=”029301 

\delcode ‘\ [="05B302 \delcode ‘\ I ="26A30C \delcode ‘ \] ="05D303 

\delcode ‘\<="26830A \delcode ‘\\="26E30F \delcode ‘\>="26930B 

It’s important to note that \delcode‘\{ and \delcode‘\} have been left equal to — 1. 
If those codes were set to certain values, a user would be able to type, e.g., ‘\big{’ 
to get a big left brace; but it would be a big mistake. The reason is that braces are 
used for grouping, when supplying arguments to macros; all sorts of strange things can 
happen if you try to use them both as math delimiters and group delimiters. 

At this point the plain.tex file contains several definitions 

\chardef \@ne=l \chardef \tw@=2 \chardef\thr@@=3 \chardef \sixt@@n=16 
\chardef \@cclv=255 \mathchardef\@cclvi=256 

\mathchardef \@m= 1000 \mathchardef \@M= 10000 \mathchardef \@MM=20000 

which allow “private” control sequences \@ne, \tw@, etc., to be used as abbreviations 
for commonly used constants 1,2,...; this convention makes T)gX run a little faster, 
and it means that the macros will consume slightly less memory space. The usage 
of these abbreviations will not, however, be shown below unless necessary; we shall 
pretend that T u ’ appears instead of \@ne, T0000 u ’ instead of \@M, and so on, since 
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that makes the programs more readable. (Notice that the long form of \@ne is T u ’ 
including a space, because TgX looks for and removes a space following a constant.) 

2. Allocation of registers. The second major part of the plain.tex file provides a 
foundation on which systems of independently developed macros can coexist peacefully 
without interfering in their usage of registers. The idea is that macro writers should 
abide by the following ground rules: (1) Registers numbered 0 to 9 are always free for 
temporary “scratch” use, but their values are always assumed to be clobbered whenever 
any other macro might get into control. (This applies to registers like \dimenO, \toksO, 
\skipl, \box3, etc.; but TjgX has already reserved \countO through \count9 for page 
number identification.) (2) The registers \count255, \dimen255, and \skip255 are 
freely available in the same way. (3) All assignments to the scratch registers whose 
numbers are 1, 3, 5, 7, and 9 should be \global; all assignments to the other scratch 
registers (0, 2, 4, 6, 8, 255) should be nonAglobal. (This prevents the phenomenon of 
“save stack buildup” discussed in Chapter 27.) (4) Furthermore, it’s possible to use any 
register in a group, if you ensure that TjjX’s grouping mechanism will restore the register 
when you’re done with the group, and if you are certain that other macros will not 
make global assignments to that register when you need it. (5) But when a register is 
used by several macros, or over long spans of time, it should be allocated by \newcount, 
\newdimen, \newbox, etc. (6) Similar remarks apply to input/output streams used by 
\read and \write, to math families used by \fam, to sets of hyphenation rules used by 
\language, and to insertions (which require \box, \count, \dimen, and \skip registers 
all having the same number). 

Some handy abbreviations are introduced at this point so that the macros 
below will have easy access to scratch registers: 

\countdef\count@=255 \toksdef\toks@=0 \skipdef\skip@=0 

\dimendef\dimen@=0 \dimendef \dimen@i=l \dimendef \dimen@ii=2 

Here now are the macros that provide allocation for quantities of more per- 
manent value. These macros use registers \countlO through \count20 to hold the 
numbers that were allocated most recently; for example, if \newdimen has just reserved 
\dimenl5, the value of \countll will be 15. However, the rest of the world is not 
supposed to “know” that \ count 11 has anything to do with \dimen registers. There’s 
a special counter called \allocationnumber that will be equal to the most recently al- 
located number, after every \newcount, \newdimen, . . . , \newinsert operation; macro 
packages are supposed to refer to \allocationnumber if they want to find out what 
number was allocated. The inside story of how allocation is actually performed should 
be irrelevant when the allocation macros are used at a higher level; you mustn’t assume 
that plain.tex really does allocation in any particular way. 

\countl0=22 '/, this counter allocates \count registers 23, 24, 25, ... 

\countll=9 '/, this counter allocates \dimen registers 10, 11, 12, ... 

\countl2=9 '/, this counter allocates \skip registers 10, 11, 12, ... 

\countl3=9 '/, this counter allocates \muskip registers 10, 11, 12, ... 
\countl4=9 '/, this counter allocates \box registers 10, 11, 12, ... 

\countl5=9 '/, this counter allocates \toks registers 10, 11, 12, ... 

\countl6=-l '/, this counter allocates input streams 0, 1, 2, ... 

\countl7=-l '/, this counter allocates output streams 0, 1, 2, ... 

\countl8=3 '/, this counter allocates math families 4, 5, 6, ... 
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\countl9=0 '/, this counter allocates language codes 1, 2, 3, ... 
\count20=255 '/, this counter allocates insertions 254, 253, 252, ... 
\countdef\insc@unt=20 '/, nickname for the insertion counter 
\countdef\allocationnumber=21 */, the most recent allocation 
\countdef\m@ne=22 \m@ne=-l '/, a handy constant 

\def \wlog{\immediate\write-l} '/, this will write on log file (only) 
\outer\def \newcount{\alloc@0\count\countdef\insc@unt} 

\outer\def \newdimen{\alloc@l\dimen\dimendef\insc@unt} 

\outer\def \newskip{\alloc@2\skip\skipdef\insc@unt} 

\outer\def \newmuskip{\alloc@3\muskip\muskipdef \@cclvi} 

\outer\def \newbox{\alloc@4\box\chardef \insc@unt} 

\let\newtoks=\relax '/, this allows plain.tex to be read in twice 
\outer\def \newhelp#l#2{\newtoks#l#l=\expandafter{\csname#2\endcsname}} 
\outer\def \newtoks{\alloc@5\toks\toksdef\<3cclvi} 

\outer\def \newread{\alloc@6\read\chardef\sixt@@n} 

\outer\def \newwrite{\alloc@7\write\chardef \sixt@@n} 

\outer\def \newfam{\alloc@8\f am\chardef \sixt@@n} 

\outer\def \newlanguage{\alloc@9\language\chardef\@cclvi} 

\def \alloc@#l#2#3#4#5{\global\advance\countl#l by 1 
\ch@ck#l#4#2'/, make sure there’s still room 

\allocationnumber=\countl#l \global#3#5=\allocationnumber 
\wlog{\string#5=\string#2\the\allocationnumber}} 

\outer\def \newinsert#l{\global\advance\insc@unt by-1 
\ch@ckO\insc@unt\count \ch@ckl\insc@unt\dimen 
\ch@ck2\insc@unt\skip \ch@ck4\insc<3unt\box 
\allocationnumber=\ins cOunt 
\global\chardef#l=\allocationnumber 

\wlog{\string#l=\string\insert\the\allocationnumber}} 

\def \ch@ck#l#2#3{\ifnum\countl#l<#2'/, 

\else\errmessage{No room for a new #3}\fi} 

The ‘\alloc<§’ macro does most of the work of allocation. It puts a message like 
‘\maxdimen=\dimenlO’ into the log file after \newdimen has allocated a place for the 
\dimen register that will be called \maxdimen; such information might be useful when 
difficult macros are being debugged. 

A \newhelp macro has been provided to aid in creating home-made help texts: 
You can say, e.g., \newhelp\helpout{This is a help message.}, and then give the 
command ‘\errhelp=\helpout’ just before issuing an \errmessage. This method of 
creating help texts makes efficient use of TjjX’s memory, because it puts the text into 
a control sequence name where it doesn’t take up space that is needed for tokens. 

The plain file now goes ahead and allocates registers for important constants: 

\newdimen\maxdimen \maxdimen=16383 . 99999pt 
\newskip\hideskip \hideskip=-1000pt pluslfill 
\newskip\centering \centering=Opt plus lOOOpt minus lOOOpt 
\newdimen\p@ \p@=lpt */, this saves macro space and time 
\newdimen\z@ \z@=0pt \newskip\z@skip \z@skip=0pt plusOpt minusOpt 
\newbox\voidb@x '/, permanently void box register 
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The control sequence \maxdimen stands for the largest permissible (dimen). Alignment 
macros that appear below will make use of special glue values called \hideskip and 
\centering. N.B.: These three constants must not be changed under any circum- 
stances-, you should either ignore them completely or just use them and enjoy them. 
In fact, the next four constant registers (\p@, \z@, \z@skip, and \voidb@x) have been 
given private names so that they are untouchable. The control sequence \p@ is used 
several dozen times as an abbreviation for ‘pt and \z@ is used quite often to stand for 
either ‘Opt ’ or ‘0 the use of such abbreviations saves almost 10% of the space needed 
to store the tokens in plain TjrX’s macros. But we shall stick to the unabbreviated forms 
below, so that the macros are more readable. 

A different sort of allocation comes next: 

\outer\def \newif #l{\count@=\e scape char \escapechar=-l 
\expandafter\expandafter\expandafter 
\edef\@if#l{true}{\let\noexpand#l=\noexpand\iftrue}'/, 
\expandafter\expandafter\expandafter 
\edef\@if#l{false}{\let\noexpand#l=\noexpand\if false}'/, 

\<§if #l{f alse}\escapechar=\count@} / the condition starts out false 
\def \@if #l#2{\csname\expandafter\if@\string#l#2\endcsname} 

{\uccode ‘ 1=‘ i \uccode‘2=‘f \uppercase{\gdef\if @12{}}} '/, ‘if’ is required 

For example, the command \newif \ifalpha creates a trio of control sequences called 
\alphatrue, \alphafalse, and \ifalpha (see Chapter 20). 

3. Parameters. IMITEX sets almost all of the numeric registers and parameters equal 
to zero; it makes all of the token registers and parameters empty; and it makes all of 
the box registers void. But there are a few exceptions: \mag is set initially to 1000, 
\tolerance to 10000, \maxdeadcycles to 25, \hangafter to 1, \escapechar to ‘\\, 
and \endlinechar to Plain IjrpC assigns new parameter values as follows: 

\pretolerance=100 \tolerance=200 \hbadness=1000 \vbadness=1000 
\linepenalty=10 \hyphenpenalty=50 \exhyphenpenalty=50 
\binoppenalty=700 \relpenalty=500 

\clubpenalty=150 \widowpenalty=150 \displaywidowpenalty=50 
\brokenpenalty=100 \predisplaypenalty= 10000 

\doublehyphendemer it s= 10000 \f inalhyphendemerits=5000 \adjdemerits=10000 
\tracinglostchars=l \uchyph=l \delimiterf actor=901 
\defaulthyphenchar=‘\- \def aultskewchar=-l \newlinechar=-l 
\showboxbreadth=5 \showboxdepth=3 \errorcontextlines=5 
\hfuzz=0.1pt \vfuzz=0.1pt \overfullrule=5pt 
\hsize=6.5in \vsize=8.9in \parindent=20pt 

\maxdepth=4pt \splitmaxdepth=\maxdimen \boxmaxdepth=\maxdimen 
\delimitershortf all=5pt \nulldelimiterspace=l . 2pt \scriptspace=0. 5pt 
\parskip=0pt plus lpt 

\abovedisplayskip=12pt plus 3pt minus 9pt 
\abovedisplayshortskip=Opt plus 3pt 
\belowdisplayskip=12pt plus 3pt minus 9pt 
\belowdisplayshortskip=7pt plus 3pt minus 4pt 
\topskip=10pt \splittopskip=10pt 
\parf illskip=0pt plus lfil 
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\thinmuskip=3mu 

\medmuskip=4mu plus 2mu minus 4mu 
\thickmuskip=5mu plus 5mu 

(Some parameters are set by T^X itself as it runs, so it is inappropriate to ini- 
tialize them: \time, \day, \month, and \year are established at the beginning of a job; 
\outputpenalty is given a value when an \output routine is invoked; \predisplaysize, 
\displaywidth, and \displayindent get values just before a display is processed; and 
the values \looseness=0, \hangindent=Opt, \hangafter=l, \parshape=0 are assigned 
at the end of a paragraph and when TjrX enters internal vertical mode.) 

The parameters \baselineskip, \lineskip, and \lineskiplimit have not 
been initialized here, but a macro called \normalbaselines is defined below; this 
macro sets \baselineskip=\normalbaselineskip, \lineskip=\normallineskip, and 
\lineskiplimit=\normallineskiplimit. An indirect approach like this has been used 
so that several different type sizes may be handled, as illustrated in Appendix E. Plain 
TjgX deals exclusively with 10 pt type, but it supports extension to other styles. 

Some “pseudo parameters” come next. These quantities behave just like in- 
ternal parameters of TfrX, and users are free to change them in the same way, but they 
are part of the plain TpX format rather than primitives of the language. 

\newskip\smallskipamount '/, the amount of a \smallskip 
\smallskipamount=3pt pluslpt minuslpt 
\newskip\medskipamount '/, the amount of a \medskip 
\medskipamount=6pt plus2pt minus2pt 
\newskip\bigskipamount '/, the amount of a \bigskip 
\bigskipamount=12pt plus4pt minus4pt 
\newskip\normalbaselineskip '/, normal value of \baselineskip 
\normalbaselineskip=12pt 

\newskip\normallineskip */, normal value of \lineskip 
\normall ineskip= lpt 

\newdimen\normallineskiplimit '/, normal value of \lineskiplimit 
\normallineskiplimit=Opt 

\newdimen\ jot '/, unit of measure for opening up displays 
\jot=3pt 

\newcount\interdisplaylinepenalty '/, interline penalty in \displaylines 
\interdisplaylinepenalty=100 

\newcount\interf ootnotelinepenalty '/, interline penalty in footnotes 
\interfootnotelinepenalty=100 
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4- Font information. Now plain.tex brings in the data that TjrX needs to know about 
how to typeset lots of characters in lots of different fonts. First the \magstep macros 
are defined, to support font scaling: 

\def \magstephalf {1095 } 

\def \magstep#l{\if case#l 1000\or 

1200\or 1440\or 1728\or 2074\or 2488\f i\relax} 

(Incidentally, \magstep doesn’t use \multiply to compute values, since it is supposed to 
expand to a (number) enroute to TjgX’s “stomach”; \multiply wouldn’t work, because 
it is an assignment command, performed only in the stomach.) 
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One of the main things that distinguishes one format from another is the fact 
that each format gives TgX the necessary knowledge about a certain family of typefaces. 
In this case the Computer Modern types described in Appendix F are taken as a basis, 
although there is a provision for incorporating other styles. 


\f ont\tenrm=cmrlO 
\f ont\sevenrm=cmr7 

\f ont\teni=cmmilO 
\f ont\seveni=cmmi7 

\f ont\tensy=cmsylO 
\f ont\sevensy=cmsy7 

\f ont\tenex=cmexlO 

\f ont\tenbf=cmbxlO 
\f ont\sevenbf=cmbx7 

\f ont\tensl=cmsllO 
\f ont\tentt=cmttlO 
\f ont\tenit=cmtilO 
\f ont\preloaded=cmsslO 
\f ont\preloaded=cmssilO 


\font\preloaded=cmr9 

\font\preloaded=cmr6 

\font\preloaded=cmmi9 

\font\preloaded=cmmi6 

\font\preloaded=cmsy9 

\font\preloaded=cmsy6 

\font\preloaded=cmbx9 

\font\preloaded=cmbx6 

\font\preloaded=cmsl9 

\font\preloaded=cmtt9 

\font\preloaded=cmti9 

\font\preloaded=cmssq8 

\font\preloaded=cmssqi8 


\font\preloaded=cmr8 
\font\f iverm=cmr5 

\font\preloaded=cmmi8 
\font\f ivei=cmmi5 

\font\preloaded=cmsy8 
\font\f ivesy=cmsy5 

\font\preloaded=cmbx8 
\font\f ivebf=cmbx5 

\font\preloaded=cmsl8 

\font\preloaded=cmtt8 

\font\preloaded=cmti8 


\f ont\preloaded=cmr7 scaled \magstep4 '/, for titles 
\f ont\preloaded=cmttlO scaled \magstep2 
\f ont\preloaded=cmssbxlO scaled \magstep2 

'/, Additional \preloaded fonts can be specified here. 

'/, (And those that were \preloaded above can be eliminated.) 
\let\preloaded=\undef ined */, preloaded fonts must be declared anew later. 


Notice that most of the fonts have been called \preloaded; but the control sequence 
\preloaded is made undefined at the very end, so those fonts cannot be used directly. 
There are two reasons for this strange approach: First, it is desirable to keep the total 
number of fonts of plain TgX relatively small, because plain TgX is a sort of standard 
format; it shouldn’t cost much for someone to acquire all the fonts of plain l^gX in 
addition to those he really wants. Second, it is desirable on many computer systems to 
preload the information for most of the fonts that people will actually be using, since 
this saves a lot of machine time. The \preloaded font information goes into T^rX’s 
memory, where it will come alive instantly if the user defines the corresponding \font 
again. For example, the book format in Appendix E says l \font\ninerm=cmr9’; after 
that assignment lias been obeyed, the control sequence \ninerm will identify the cmr9 
font, whose information does not have to be loaded again. 

The exact number and nature of fonts that are preloaded is unimportant; the 
only important thing needed for standardization between machines is that sixteen basic 
fonts (cmrlO, cmr7, . . . , cmtilO) should actually be loaded. The plain.tex files used 
on different machines can be expected to differ widely with respect to preloaded fonts, 
since the choice of how many fonts to preload and the selection of the most important 
fonts depends on local conditions. For example, at the author’s university it is useful 
to preload a font that contains the Stanford seal, but that particular font is not very 
popular at Berkeley. 


Computer Modern 
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Most of these fonts have the default values of \hyphenchar and \skewchar, 
namely and -1; but the math italic and math symbol fonts have special \skewchar 
values, which are defined next: 

\skewchar\teni=’ 177 \skewchar\seveni= ’ 177 \skewchar\f ivei= ’ 177 
\skewchar\tensy=’60 \skewchar\sevensy=’60 \skewchar\fivesy=’60 

Once the fonts are loaded, they are also grouped into families for use in math 
setting, and shorthand names like \rm and \it are defined: 

\textf ontO=\tenrm \scriptf ontO=\sevenrm \scriptscriptfontO=\f iverm 
\def \rm{\f amO \tenrm} 

\textf ontl=\teni \scriptf ontl=\seveni \scriptscriptfontl=\f ivei 
\def \mit{\faml } \def\oldstyle{\faml \teni} 

\textf ont2=\tensy \scriptf ont2=\sevensy \scriptscriptfont2=\f ivesy 
\def \cal{\fam2 } 

\textf ont3=\tenex \scriptf ont3=\tenex \scriptscriptfont3=\tenex 

\newfam\itfam \def\it{\fam\itfam\tenit} \textfont\itf am=\tenit 
\newfam\slfam \def\sl{\fam\slfam\tensl} \textfont\slf am=\tensl 
\newfam\bffam \def\bf {\fam\bffam\tenbf} \textfont\bff am=\tenbf 
\scriptf ont\bffam=\sevenbf \scriptscriptf ont\bffam=\f ivebf 

\newfam\ttfam \def\tt{\fam\ttfam\tentt} \textfont\ttf am=\tentt 


5. Macros for text. The fifth part of plain.tex introduces macros that do basic for- 
matting unrelated to mathematics. First come some macros that were promised above: 

\def\frenchspacing{\sf code ‘\ . =1000 \sf code ‘\?=1000 \sf code ‘\ ! =1000 
\sf code ‘ \ : =1000 \sf code ‘ \ ; =1000 \sf code ‘ \ , =1000 } 

\def \nonfrenchspacing{\sf code ‘\ . =3000 \sf code ‘\?=3000 \sf code ‘ \ ! =3000 
\sf code ‘ \ : =2000 \sf code ‘ \ ; =1500 \sf code ‘ \ , =1250 } 

\def \normalbaselines{\line skip=\normallineskip 

\baselineskip=\normalbaselineskip \lineskiplimit=\normallineskiplimit} 

The next macros are simple but vital. First \{tab) and \ (return) are defined 
so that they expand to \ (space); this helps to prevent confusion, since all three cases 
look identical when displayed on most computer terminals. Then the macros \lq, 
\rq, \lbrack, and \rbrack are defined, for people who have difficulty typing quota- 
tion marks and/or brackets. The control sequences \endgraf and \endline are made 
equivalent to TJrX’s primitive \par and \cr operations, since it is often useful to rede- 
fine the meanings of \par and \cr themselves. Then come the definitions of \space 
(a blank space), \empty (a list of no tokens), and \null (an empty libox). Finally, 
\bgroup and \egroup are made to provide “implicit” grouping characters that turn 
out to be especially useful in macro definitions. (See Chapters 24-26 and Appendix D 
for information about implicit characters.) 

\def\**K\ } \def\“JK\ } 

\def\lq{‘} \def\rq{’} \def\lbrack{ [} \def \rbrack{] } 

\let\endgraf =\par \let\endline=\cr 
\def\space{ } \def\empty{} \def\null{\hbox{}} 

\let\bgroup={ \let\egroup=} 
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Something a bit tricky comes up now in the definitions of \obeyspaces and 
\obeylines, since TpX is only “half obedient” while these definitions are half finished: 

\def \obeyspaces{\catcode ‘\ =\active} 

{\obeyspaces\global\let =\space} 

{\catcode ‘ \“M=\active '/, these lines must end with 
\gdef \obey line s{\cat code ‘\~~M=\active \let“M=\par}'/, 

\global\let""M=\par} '/, this is in case “M appears in a \write 

The \obeylines macro says ‘\let~“M=\par’ instead of ‘\def ~~M{\par}’ because the 
Met technique allows constructions such as ‘\let\par=\cr \obeylines \halign{. . 
in which \cr’s need not be given within the alignment. 

The \loop. . .\repeat macro provides for iterative operations as illustrated at 
the end of Chapter 20. In this macro and several others, the control sequence ‘\next’ 
is given a temporary value that is not going to be needed later; thus, \next acts like a 
“scratch control sequence.” 

\def\loop#l\repeat{\def\body{#l}\ iterate} 

\def \iterate{\body \let\next=\iterate \else\let\next=\relax\f i \next} 
\let\repeat=\f i '/, this makes \loop. . . \if . . . \repeat skippable 

Spacing is the next concern. The macros \enskip, \quad, and \qquad provide 
spaces that are legitimate breakpoints within a paragraph; \enspace, \thinspace, 
and \negthinspace produce space that cannot cause a break (although the space will 
disappear if it occurs just next to certain kinds of breaks). All six of these spaces 
are relative to the current font. You can get horizontal space that never disappears 
by saying ‘\hglue(glue)’; this space is able to stretch or shrink. Similarly, there’s a 
vertical analog, I \vglue(glue)’. The \nointerlineskip macro suppresses interline glue 
that would ordinarily be inserted before the next box in vertical mode; this is a “one 
shot” macro, but \off interlineskip is more drastic — it sets things up so that future 
interline glue will be present, but zero. There also are macros for potentially breakable 
vertical spaces: \smallskip, \medskip, and \bigskip. 

\def \enskip{\hskip . 5em\relax} \def\enspace{\kern. 5em } 

\def \quad{\hskiplem\relax} \def\qquad{\hskip2em\relax} 

\def \thinspace{\kern .16667em } \def\negthinspace{\kern- . 16667em } 

\def \hglue{\afterassignment\hgl@\skip@=} 

\def \hgl@{\leavevmode \count@=\spacef actor \vrule widthOpt 
\nobreak\hskip\skip@ \spacef act or =\ count®} 

\def \vglue{\afterassignment\vgl@\skip@=} 

\def \vgl@{\par \dimen@=\prevdepth \hrule heightOpt 
\nobreak\vskip\skip@ \prevdepth=\dimen@} 

\def \topglue{\nointerlineskip \vglue-\topskip \vglue} '/, for top of page 

\def \nointerlineskip{\prevdepth=-1000pt } 

\def \off interlineskip{\baseline skip=-1000pt 
\lineskip=Opt \lineskiplimit=\maxdimen} 

\def \smallskip{\vskip\smallskipamount} 

\def \medskip{\vskip\medskipamount} 

\def \bigskip{\vskip\bigskipamount} 
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Speaking of breakpoints, the following macros introduce penalty markers that 
make breaking less, or more, desirable. The \break, \nobreak, and \allowbreak 
macros are intended for use in any mode; the ~ (tie) and \slash (hyphen-like ‘/’) 
macros are intended for horizontal mode. The others are intended only for vertical 
mode, i.e., between paragraphs, so they begin with \par. 

\def \break{\penalty- 10000 } \def \nobreak{\penaltyl0000 } 

\def \allowbreak{\penaltyO } 

\def ~{\penaltyl0000\ } 

\def \slash{/\penalty\exhyphenpenalty} 

\def \f ilbreak{\par\vf il\penalty-200\vf ilneg} 

\def \goodbreak{\par\penalty-500 } 

\def \e ject{\par\penalty-10000 } 

\def \supereject{\par\penalty-20000 } 

\def \removelastskip{\ifdim\lastskip=Opt \else\vskip-\lastskip\f i} 

\def \smallbreak{\par \ifdim\lastskip<\smallskipamount 
\removelastskip \penalty-50 \smallskip \fi} 

\def \medbreak{\par \ifdim\lastskip<\medskipamount 
\removelastskip \penalty-100 \medskip \fi} 

\def \bigbreak{\par \ifdim\lastskip<\bigskipamount 
\removelastskip \penalty-200 \bigskip \fi} 

Boxes are next: Mine, Meftline, \rightline, and \centerline produce 
boxes of the full line width, while \llap and \rlap make boxes whose effective width 
is zero. The \underbar macro puts its argument into an hbox with a straight line at a 
fixed distance under it. 

\def \line{\hbox to\hsize} 

\def Meftline# l{\line{#l\hss}} \def \rightline#l{\line{\hss#l}} 

\def \centerline#l{\line{\hss#l\hss}} 

\def \llap#l{\hbox to 0pt{\hss#l}} \def \rlap#l{\hbox to 0pt{#l\hss}} 

\def \m@th{\mathsurround=Opt } 

\def \underbar#l{$\setboxO=\hbox{#l} \dp0=0pt 
\m@th \underline{\boxO}$} 

(Notice that \underbar uses math mode to do its job, although the operation is essen- 
tially non-mathematical in nature. A few of the other macros below use math mode 
in similar ways; thus, TjrX’s mathematical abilities prove to be useful even when no 
mathematical typesetting is actually being done. A special control sequence \m@th is 
used to “turn off” \mathsurround when such constructions are being performed.) 

A \strut is implemented here as a rule of width zero, since this takes minimum 
space and time in applications where numerous struts are present. 

\newbox\ strutbox 

\setbox\strutbox=\hbox{\vrule height8.5pt depth3.5pt widthOpt} 

\def \strut{\relax\iframode\copy\strutbox\else\unhcopy\strutbox\f i} 

The ‘\relax’ in this macro and in others below is necessary in case \strut appears 
first in an alignment entry, because T^X is in a somewhat unpredictable mode at such 
times (see Chapter 22). 
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The Malign macro provides for alignments when it is necessary to be sure 
that \tabskip is initially zero. The \hidewidth macro can be used essentially as \hf ill 
in alignment entries that are permitted to “stick out” of their column. There’s also 
\multispan, which permits alignment entries to span one or more columns. 

\def \ialign{\everycr={}\tabskip=Opt \halign} '/, initialized \halign 
\def \hidewidth{\hskip\hide skip} 

\newcount\ms count 

\def \multispan#l{\omit \mscount=#l \loop\ifnum\mscount>l \sp@n\repeat} 

\def \sp@n{\span\omit \advance\mscount by -1 } 

Now we get to the “tabbing” macros, which are more complicated than any- 
thing else in plain Tj^X. They keep track of the tab positions by maintaining boxes 
full of empty boxes having the specified widths. (The best way to understand these 
macros is probably to watch them in action on simple examples, using \tracingall.) 

\newif\ifus@ \newif \if@cr 

\newbox\tabs \newbox\tabsyet \newbox\tabsdone 

\def \cleartabs{\global\setbox\tabsyet=\null \setbox\tabs=\null} 

\def \settabs{\setbox\tabs=\null \futurelet\next\sett@b} 

\let\+=\relax */, in case this file is being read in twice 
\def \sett@b{\ifx\next\+ \let\next=\relax */, turn off \outerness 
\def\next{\afterassignment\s@tt@b\let\next}’/, 

\else\let\next=\s@tcols\f i\next} 

\def \s@tt@b{\let\next=\relax \us@false\m@ketabbox} 

\outer\def \+{\tabalign} \def\tabalign{\us@true \m@ketabbox} 

\def \s@tcols#l\columns{\ count <§=#1 \dimen@=\hsize 
\loop \ifnum\count@>0 \@nother \repeat} 

\def \@nother{\dimen@ii=\dimen@ \divide\dimen@ii by\count@ 
\setbox\tabs=\hbox{\hbox to\dimen@ii{}\unhbox\tabs}'/, 

\advance\dimen@ by-\dimen@ii \advance\count@ by -1 } 

\def \m@ketabbox{\begingroup 

\global\setbox\tabsyet=\copy\tabs \global\setbox\tabsdone=\null 
\def \cr{\@crtrue\crcr\egroup\egroup 
\ifus@ \unvboxO \lastbox\fi \endgroup 
\setbox\tabs=\hbox{\unhbox\tabsyet\unhbox\tabsdone}}‘/, 
\setboxO=\vbox\bgroup\@crf alse \ialign\bgroup&\t@bbox##\t@bb@x\crcr} 

\def \t@bbox{\setboxO=\hbox\bgroup} 

\def \t@bb@x{\if@cr\egroup */, now \boxO holds the column 

\else\hss\egroup \global\setbox\tabsyet=\hbox{\unhbox\tabsyet 
\global\setboxl=\lastbox}'/, now \boxl holds its size 
\ifvoidl \global\setboxl=\hbox to\wdO{}*/, 

\else\setboxO=\hbox to\wdl{\unhboxO}\f i 
\global\setbox\tabsdone=\hbox{\boxl\unhbox\tabsdone}\f i 
\boxO} 
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The macro \+ has been declared ‘\outer’ here, so that TfgX will be better able to 
detect runaway arguments and definitions (see Chapter 20). A non-\outer version, 
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called \tabalign, has also been provided in case it is necessary to use \+ in some 
“inner” place. You can use \tabalign just like \+, except after \settabs. 

• Paragraph shapes of a limited but important kind are provided by \item, 
\itemitem, and \narrower. There are also two macros that haven’t been mentioned 
before: (1) \hang causes hanging indentation by the normal amount of \parindent, 
after the first line; thus, the entire paragraph will be indented by the same amount 
(unless it began with \noindent). (2) \textindent{stuff} is like \indent, but it puts 
the ‘stuff’ into the indentation, flush right except for an en space; it also removes spaces 
that might follow the right brace in ‘{stuff}’. For example, the present paragraph 
was typeset by the commands ‘\textindent{$\bullet$} Paragraph shapes . . the 
opening ‘P’ occurs at the normal position for a paragraph’s first letter. 

\def\hang{\hangindent\par indent} 

\def\item{\par\hang\text indent} 

\def \itemitem{\par\indent \hangindent2\parindent \textindent} 

\def\t ext indent# l{\indent\llap{#l\enspace}\ignore spaces} 

\def \narrower{\advance\leftskip by\par indent 
\advance\rightskip by\parindent} 

The \beginsection macro is intended to mark the beginning of a new major 
subdivision in a document; to use it, you say ‘\beginsection(section title)’ followed by 
a blank line (or \par). The macro first emits glue and penalties, designed to start a new 
page if the present page is nearly full; then it makes a \bigskip and puts the section 
title flush left on a line by itself, in boldface type. The section title is also displayed on 
the terminal. After a \smallskip, with page break prohibited, a \noindent command 
is given; this suppresses indentation in the next paragraph, i.e. , in the first paragraph 
of the new section. (However, the next “paragraph” will be empty if vertical mode 
material immediately follows the \beginsection command.) 

\outer\def \beginsection#l\par{\vskipOpt plus . 3\vsize\penalty-250 
\vskipOpt plus- . 3\vsize\bigskip\vskip\parskip 
\message{#l}\leftline{\bf#l}\nobreak\smallskip\no indent} 

Special statements in a mathematical paper are often called theorems, lemmas, 
definitions, axioms, postulates, remarks, corollaries, algorithms, facts, conjectures, or 
some such things, and they generally are given special typographic treatment. The 
\proclaim macro, which was illustrated earlier in this appendix and also in Chapter 20, 
puts the title of the proclamation in boldface, then sets the rest of the paragraph in 
slanted type. The paragraph is followed by something similar to \medbreak, except 
that the amount of penalty is different so that page breaks are discouraged: 

\outer\def \proclaim #1. #2\par{\medbreak 
\noindent{\bf #1 . \enspace}{\sl#2\par}'/, 

\ifdim\lastskip<\medskipamount \removelastskip\penalty55\medskip\f i} 

Ragged-right setting is initiated by restricting the spaces between words to 
have a fixed width, and by putting variable space at the right of each line. You should 
not call \raggedright until your text font has already been specified; it is assumed that 
the ragged-right material will not be in a variety of different sizes. (If this assumption 
is not valid, a different approach should be used: \f ontdimen parameters 3 and 4 of the 
fonts you will be using should be set to zero, by saying, e.g., ‘\f ontdimen3\tenrm=0pt’. 
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These parameters specify the stretchability and shrinkability of interword spaces.) A 
special macro \ttraggedright should be used for ragged-right setting in typewriter 
type, since the spaces between words are generally bigger in that style. (Spaces are 
already unstretchable and unshrinkable in font cmtt.) 

\def \raggedright{\rightskip=Opt plus2em 

\spaceskip=. 3333em \xspaceskip= . 5em\relax} 

\def \ttraggedright{\tt\rightskip=Opt plus2em\relax} 

Now we come to special symbols and accents, which depend primarily on the 
characters available in the Computer Modern fonts. Different constructions will be 
necessary if other styles of type are used. When a symbol is built up by forming a box, 
the \leavevmode macro is called first; this starts a new paragraph, if TgX is in vertical 
mode, but does nothing if TgX is in horizontal mode or math mode. 

\chardefY/,= ‘ V/. \chardef\&=‘\& \chardef\#=‘\# \chardef\$=‘\$ 

\chardef \ss="19 

\chardef\ae="lA \chardef\oe="lB \chardef\o="lC 
\chardef \AE= "ID \chardef \0E="1E \chardef\0="lF 
\chardef \i=" 10 \chardef\j="ll '/, dotless letters 
\def \aa{\accent ’ 27a} \def \l{\char ’401} 

\def \leavevmode{\unhbox\voidb@x} '/, begins a paragraph, if necessary 
\def \_{\leavevmode \kern.06em \vbox{\hrule width0.3em}} 

\def \L{\leavevmode\setboxO=\hbox{L}\hbox to\wdO{\hss\char ’40L}} 

\def \AA{\leavevmode\setboxO=\hbox{h}\dimen@=\htO \advance\dimen<9 by- lex 
\rlap{\raise . 67\dimen@\hbox{\char ’ 27}}A} 

\def \mathhexbox#l#2#3{\leavevmode 
\hbox{$\m@th \mathchar"#l#2#3$}} 

\def \dag{\mathhexbox279} \def \ddag{\mathhexbox27A} 

\def \S{\mathhexbox278} \def \P{\mathhexbox27B} 

\def \oalign#l{\leavevmode\vtop{\baselineskipOpt \lineskip . 25ex 
\ialign{##\crcr#l\crcr}}} '/, put characters over each other 
\def \ooalign{\lineskiplimit-\maxdimen \oalign} 

\def \d#l{\oalign{#l\crcr\hidewidth. \hidewidth}} 

\def \b#l{\oalign{#l\crcr\hidewidth 

\vbox to . 2ex{\hbox{\char ’26}\vss}\hidewidth}} 

\def \c#l{\setboxO=\hbox{#l}\ifdim\htO=lex \accent’30 #1'/, 

\else{\ooalign{\hidewidth\char ’ 30\hidewidth\crcr\unhbox0}}\f i} 

\def \copyright{{\ooalign 

{\hf il\raise . 07ex\hbox{c}\hf il\crcr\mathhexbox20D}}} 

\def \dots{\relax\ifmmode\ldots\else$\m@th \ldots\ , $\f i} 

\def \TeX{T\kern- . 1667em \lower . 5ex\hbox{E}\kern- . 125em X} 

\def \ ‘ #l{{\accent" 12 #1}} \def \ ’ #l{{\accent" 13 #1}} 

\def \v#l{{\accent" 14 #1}} \def \u#l{{\accent" 15 #1}} 

\def \=#l{{\accent" 16 #1}} \def \~#l{{\accent"5E #1}} 

\def\.#l{{\accent"5F #1}} \def \H#l{{\accent"7D #1}} 

\def \~#l{{\accent"7E #1}} \def \"#l{{\accent"7F #1}} 

\def \t#l{{\edef\next{\the\font}\the\textf ontl\accent"7F\next#l}} 
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At this point three alternative control-symbol accents are defined, suitable for 
keyboards with extended character sets (cf. Appendix C): 

\let\**_=\v \let\~"S=\u \let\“D=\~ 

Various ways to fill space with leaders are provided next. 

\def \hrulef ill{\leaders\hrule\hf ill} 

\def \dotf ill{\cleaders\hbox{$\m@th \mkernl.5mu . \mkernl . 5mu$}\hf ill} 

\def \rightarrowf ill{$\m@th \mathord- \mkern-6mu 

\cleaders\hbox{$\mkern-2mu \mathord- \mkern-2mu$}\hf ill 
\mkern-6mu \mathord\rightarrow$} 

\def \leftarrowf ill{$\m@th \mathord\leftarrow \mkern-6mu 
\cleaders\hbox{$\mkern-2mu \mathord- \mkern-2mu$}\hf ill 
\mkern-6mu \mathord-$} 

\mathchardef \braceld="37A \mathchardef \bracerd="37B 
\mathchardef \bracelu="37C \mathchardef \braceru="37D 
\def \upbracef ill{$\m@th 

\bracelu\leaders\vrule\hf ill\bracerd 
\braceld\leaders\vrule\hf ill\braceru$} 

\def \downbracef ill{$\m@th 

\braceld\leaders\vrule\hf ill\braceru 
\bracelu\leaders\vrule\hf ill\bracerd$} 

The \upbracefill and \downbracef ill macros have restricted usage: they must ap- 
pear a 11 by themselves in an hbox or an alignment entry, except for horizontal spacing. 

Finally, the fifth section of plain.tex closes by defining \bye: 

\outer\def \bye{\par\vf ill\supereject\end} '/, the recommended way to stop 
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6. Macros for math. The sixth section of plain.tex is the longest: but it will suffice to 
give only excerpts here, because most of it is simply a tedious listing of special symbols 
together with their font locations, and the same information appears in Appendix F. 

Some rudimentary things come first: The control sequences \sp and \sb are 
provided for people who can’t easily type * and there are four control symbols that 
provide spacing corrections: a “discretionary times sign” \* is defined; and then there’s 
an interesting set of macros that convert f ’ ’ ’ into f "{\prime\prime\prime}: 

\let\sp=“ \let\sb=_ {\catcode ‘\_=\active \global\let_=\_} 

\def \ , {\mskip\thinmuskip} \def \ ! {\mskip-\thinmuskip} 

\def \>{\mskip\medmuskip} \def \ ; {\mskip\thickmuskip} 

\def \*{\discretionary{\thinspace\the\textf ont2\char2}{}{}} 

{\catcode ‘ \~~Z=\active \gdef ~~Z{\not=}} '/, ~~Z is like \ne in math 

{\catcode ‘ \ ’ =\active \gdef ’ {“\bgroup\prim@s}} 

\def \prim@s{\prime\futurelet\next\pr@m@s} 

\def \pr@m@s{\ifx ’\next\let\nxt\pr@@@s \else\ifx*\next\let\nxt\pr@@@t 
\else\let\nxt\egroup\f i\f i \nxt} 

\def \pr@@@s#l{\prim@s} \def \pr@@@t#l#2{#2\egroup} 
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The next job is to define Greek letters and other symbols of type Ord. Up- 
percase Greek letters are assigned hexadecimal codes of the form "7xxx, so that they 
will change families when \fam changes. Three dots 1 • ■ ■ ’ are used here and below to 
indicate that additional symbols, having similar definitions, are listed in Appendix F. 

\mathchardef \alpha="010B • • • \mathchardef \omega="0121 

\mathchardef \Gairana="7000 • • • \mathchardef \0mega="700A 

\mathchardef \aleph="0240 • • • \mathchardef \spadesuit="027F 

\def \hbar{{\mathchar ’26\mkern-9inuh}} 

\def \surd{{\mathchar"1270}} 

\def \angle{{\vbox{\ialign{$\m@th\scriptstyle##$\crcr 

\not\mathrel{\mkernl4mu}\crcr \noalign{\nointerlineskip} 

\mkern2 . 5mu\leaders\hrule height . 34pt\hf ill\mkern2 . 5mu\crcr}}}} 

Large operators are assigned hexadecimal codes of the form "lxxx: 

\mathchardef \smallint="1273 

\mathchardef \sum=" 1350 ••• \mathchardef\biguplus="1355 

\mathchardef \intop=" 1352 \def \int{\intop\nolimits} 

\mathchardef \ointop="1348 \def \oint{\ointop\nolimits} 

Integral signs get special treatment so that their limits won’t be set above and below. 
Binary operations are next; nothing exciting here. 

\mathchardef \pm="2206 ••• \mathchardef\amalg="2271 

Relations are also fairly straightforward, except for the ones that are con- 
structed from other characters. The \mapstochar is a character V of width zero that 
is quite useless by itself, but it combines with right arrows to make \mapsto ‘i-T and 
\longmapsto 5 — Similarly, \not is a relation character of width zero that puts 
a slash over the character that follows. When two relations are adjacent in a math 
formula, TpX puts no space between them. 

\mathchardef \leq="3214 • • • \mathchardef \perp="323F 

\def \joinrel{\mathrel{\mkern-3mu}} 

\def\relbar{\mathrel{\ smash-}} \def \Relbar{\mathrel=} 
\def\longrightarrow{\relbar\joinrel\right arrow} 
\def\Longrightarrow{\Relbar\joinrel\Right arrow} 

\def \longleftarrow{\leftarrow\joinrel\relbar} 

\def \Longleftarrow{\Leftarrow\joinrel\Relbar} 

\def \longleftrightarrow{\leftarrow\joinrel\rightarrow} 

\def \Longleftrightarrow{\Leftarrow\joinrel\Rightarrow} 

\mathchardef \mapstochar="322F \def\mapsto{\mapstochar\right arrow} 

\def \longmapsto{\mapstochar\longrightarrow} 

\mathchardef \lhook="312C \def\hookrightarrow{\lhook\ jo inrel\right arrow} 
\mathchardef \rhook="312D \def\hookleftarrow{\leftarrow\ joinrel\rhook} 

\def \neq{\not=} \def \models{\mathrel I \joinrel=} 

\def \bowtie{\mathrel\triangleright\joinrel\mathrel\triangleleft} 

After defining characters \ldotp and \cdotp that act as math punctuation, 
it is easy to define \ldots and \cdots macros that give the proper spacing in most 
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circumstances. Vertical and diagonal dots (\vdots and \ddots) are also provided here: 

\mathchardef \ldotp="613A\mathchardef \cdotp="6201\mathchardef \colon="603A 
\def \ldots{\mathinner{\ldotp\ldotp\ldotp}} 

\def \cdots{\mathinner{\cdotp\cdotp\cdotp}} 

\def \vdots{\vbox{\baselineskip=4pt \lineskiplimit=Opt 
\kern6pt \hbox{ . }\hbox{ . }\hbox{ . }}} 

\def \ddots{\mathinner{\mkernlmu\raise7pt\vbox{\kern7pt\hbox{ . }}\mkern2mu 
\raise4pt\hbox{ . }\mkern2mu\raiselpt\hbox{ . }\mkernlmu}} 

Most of the math accents are handled entirely by the \mathaccent primitive, 
but a few of the variable-width ones are constructed the hard way: 

\def \acute{\mathaccent"7013 } • • • \def \ddot{\mathaccent"707F } 

\def \widetilde{\mathaccent"0365 } \def \widehat{\mathaccent"0362 } 

\def \overright arrow# l{\vbox{\ialign{##\crcr 

\rightarrowf ill\crcr\noalign{\kern-lpt\no inter line skip} 

$\hf il\displaystyle{#l}\hf il$\crcr}}} 

\def \overleftarrow#l{\vbox{\ialign{##\crcr 

\leftarrowf ill\crcr\noalign{\kern-lpt\nointerlineskip} 

$\hf il\displaystyle{#l}\hf il$\crcr}}} 

\def \overbrace#l{\mathop{\vbox{\ialign{##\crcr\noalign{\kern3pt} 
\downbracef ill\cr cr\noal ign{\kern3pt\no inter lineskip} 

$\hf il\displaystyle{#l}\hf il$\crcr}}}\limits} 

\def \underbrace#l{\mathop{\vtop{\ialign{##\crcr 

$\hfil\displaystyle{#l}\hfil$\crcr\noalign{\kern3pt\no inter lineskip} 
\upbracef ill\crcr\noalign{\kern3pt}}}}\limits} 

\def \skew#l#2#3{{#2{#3\mkern#lmu}\mkern-#lmu}{}} 


Now we come to 24 delimiters that can change their size: 


\def \langle{\delimiter"426830A } 

\def \lbrace{\delimiter "4266308 } 

\def \lceil{\delimiter"4264306 } 
\def\lfloor{\delimiter "4262304 } 

\def \lgroup{\delimiter"400033A } 
\def\lmoustache{\delimiter "4000340 } 
\def \uparrow{\delimiter" 3222378 } 
\def\downarrow{\delimiter" 3223379 } 
\def \updownarrow{\delimiter"326C33F } 
\def \Updownarrow{\delimiter"326D377 } 
\def \vert{\delimiter"026A30C } 

\def \backslash{\delimiter"026E30F } 


\def\rangle{\delimiter"526930B } 
\def\rbrace{\delimiter "5267309 } 
\def\rceil{\delimiter"5265307 } 
\def\rf loor{\delimiter "5263305 } 
\def\rgroup{\delimiter"500033B } 
\def\rmoustache{\delimiter "5000341 } 
\def\Uparrow{\delimiter"322A37E } 
\def\Downarrow{\delimiter"322B37F } 
\def\arrowvert{\delimiter"033C000 } 
\def \Arrowvert{\delimiter"033D000 } 
\def\Vert{\delimiter"026B30D } 
\def\bracevert{\delimiter"033E000 } 


The ‘\big. . . \Bigg’ macros produce specific sizes: 

\def \bigl{\mathopen\big} \def \bigm{\mathrel\big} \def \bigr{\mathclose\big} 
\def \Bigl{\mathopen\Big} \def \Bigm{\mathrel\Big} \def \Bigr{\mathclose\Big} 
\def \biggl{\mathopen\bigg} \def\Biggl{\mathopen\Bigg} 

\def \biggm{\mathrel\bigg} \def\Biggm{\mathrel\Bigg} 

\def \biggr{\mathclose\bigg} \def\Biggr{\mathclose\Bigg} 
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\def \big#l{{\hbox{$\left#l\vbox to 8 . 5pt{}\right . \n@space$}}} 

\def \Big#l{{\hbox{$\left#l\vbox to 11 . 5pt{}\right . \n@space$}}} 

\def \bigg#l{{\hbox{$\left#l\vbox to 14 . 5pt{}\right . \n@space$}}} 

\def \Bigg#l{{\hbox{$\left#l\vbox to 17 . 5pt{}\right . \n@space$}}} 

\def \n@space{\nulldelimiterspace=Opt \m@th} 

There are a few other simple abbreviations related to delimiters: 

\def \choose{\atopwithdelims () } 

\def \brack{\atopwithdelims [] } 

\def \brace{\atopwithdelims\{\}} 

\def \sqrt{\radical"270370 } 

And now we come to something more interesting. The \mathpalette opera- 
tion constructs a formula in all four styles: it is applied here in the implementation of 
\phantom, \smash, \root, and other operations. 

\def\mathpalette#l#2{\mathchoice{#l\di splay sty le{#2}} 

{#l\textstyle{#2}}{#l\scriptstyle{#2}}{#l\scriptscriptstyle{#2}}} 

\newbox\rootbox 

\def \root#l\of {\setbox\rootbox= 

\hbox{$\m@th \scriptscriptstyle{#l}$} 

\mathpalette\r@@t} 

\def \r@@t#l#2{\setbox0=\hbox{$\m@th #l\sqrt{#2}$} 

\dimen@=\ht0 \advance\dimen@ by-\dp0 

\mkern5mu \raise . 6\dimen@\copy\rootbox \mkern-10mu \boxO} 

\newif\ifv@ \newif\ifh@ 

\def \vphantom{\v@true\h@false\ph@nt} 

\def \hphantom{\v@f alse\h@true\ph@nt} 

\def \phantom{\v@true\h@true\ph@nt} 

\def \ph@nt{\iframode\def\next{\mathpalette\mathph@nt}'/, 
\else\let\next=\makeph@nt\f i \next} 

\def \makeph@nt#l{\setboxO=\hbox{#l}\f inphQnt} 

\def \mathph@nt#l#2{\setbox0=\hbox{$\m@th#l{#2}$}\f inphOnt} 

\def \f inph@nt{\setbox2=\null \ifv@ \ht2=\ht0 \dp2=\dp0 \fi 
\ifh@ \wd2=\wd0 \fi \box2 } 

\def \mathstrut{\vphantom(} 

\def \smash{\relax */, \relax, in case this comes first in \halign 
\ifmmode\def \next{\mathpalette\mathsm@sh}\else\let\next\make sm@sh 
\fi \next} 

\def \makesm@sh#l{\setboxO=\hbox{#l}\f insmSsh} 

\def \mathsm@sh#l#2{\setbox0=\hbox{$\m@th#l{#2}$}\f insmOsh} 

\def \f insm@sh{\ht0=0pt \dp0=0pt \boxO } 

\def \cong{\mathrel{\mathpalette\@vereq\sim}} */, \sim over = 

\def \@vereq#l#2{\lower . 5pt\vbox{\baselineskip0pt \lineskip- . 5pt 
\ialign{$\m@th#l\hf il##\hf il$\crcr#2\crcr=\crcr}}} 

\def \notin{\mathrel{\mathpalette\c@ncel\in}} 

\def \c@ncel#l#2{\ooalign{$\hf il#l\mkernlmu/\hf il$\crcr$#l#2$}} 

\def \rightleftharpoons{\mathrel{\mathpalette\rlh<3{}}} 
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\def \rlh@#l{\vcenter{\hbox{\ooalign{\raise2pt 

\hbox{$#l\rightharpoonup$}\crcr $#l\leftharpoondown$}}}} 

\def \buildrel#l\over#2{\mathrel{\mathop{\kern0pt #2}\limits"{#l}}} 

\def \doteq{\buildrel\textstyle . \over=} 

These definitions illustrate how other built-up symbol combinations could be defined 
to work in all four styles. 

Alternate names are defined now: 

\let\ne=\neq \let\le=\leq \let\ge=\geq 

\let\{=\lbrace \let\ I =\Vert \let\}=\rbrace 

\let\to=\rightarrow \let\gets=\leftarrow \let\owns=\ni 
\let\land=\wedge \let\lor=\vee \let\lnot=\neg 

\def \iff {\ ; \Longleftrightarrow\ ; } 

The 32 common functions whose names generally appear in roman letters are 
listed in Chapter 18. Only a few of the definitions need to be shown here: 

\def \arccos{\mathop{\rm arccos}\nolimits} 

\def\tanh{\mathop{\rm tanh}\nolimits} 

\def \det{\mathop{\rm det}} • • • \def\sup{\mathop{\rm sup}} 

\def \liminf{\mathop{\rm lim\,inf}} \def\limsup{\mathop{\rm lim\,sup}} 

\def \bmod{\mskip-\medmuskip \mkern5mu 

\mathbin{\rm mod} \penalty900 \mkern5mu \mskip-\medmuskip} 

\def \pmod#l{\allowbreak \mkernl8mu ({\rm mod}\,\,#l)} 

The definition of \matrix goes to some pains to ensure that two n-rowed 
matrices will have the same height and the same depth, unless at least one of their 
rows is unusually big. The definition of \bordermatrix is even more complicated, but 
it seems to work reasonably well; it uses a constant \p@renwd that represents the width 
of a big extensible left parenthesis. 

\def \matrix#l{\null\ , \vcenter{\normalbaselines\m@th 
\ialign{\hf il$##$\hf il&&\quad\hf il$##$\hf il\crcr 
\mathstrut\crcr\noalign{\kern-\baselineskip} 
#l\crcr\mathstrut\crcr\noalign{\kern-\baselineskip}}}\ ,} 

\newdimen\p@renwd \setboxO=\hbox{\tenex B} \p@renwd=\wdO 
\def \bordermatr ix# l{\begingr oup \m@th 

\setbox0=\vbox{\def\cr{\crcr\noalign{\kern2pt\global\let\cr=\endline}} 
\ialign{$##$\hf il\kern2pt\kern\p@renwd&\thinspace\hf il$##$\hfil 
&&\quad\hf il$##$\hf il\crcr 

\omit\strut\hf il\crcr\noalign{\kern-\baselineskip} 
#l\crcr\omit\strut\cr}} 

\setbox2=\vbox{\unvcopy0 \global\setboxl=\lastbox} 
\setbox2=\hbox{\unhboxl \unskip \global\setboxl=\lastbox} 
\setbox2=\hbox{$\kern\wdl\kern-\p@renwd \left( \kern-\wdl 
\global\setboxl=\vbox{\boxl\kern2pt} 

\vcenter{\kern-\htl \unvboxO \kern-\baselineskip} \,\right)$} 

\null\ ; \vbox{\kern\htl\box2}\endgroup} 
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The next macros are much simpler: 

\def \cases#l{\left\{\ , \vcenter{\normalbaselines\m@th 

\ialign{$##\hf il$&\quad##\hf il\crcr#l\crcr}}\right . } 

\def \pmatrix#l{\left ( \matrix{#l} \right)} 

Finally there are macros for displayed equations: 

\def\openup{\af ter assignment \@penup\dimen@=} 

\def \@penup{\advance\lineskip\dimen@ 

\advance\baselineskip\dimen<3 \advance\lineskiplimit\dimen@} 

\def \eqalign#l{\null\ , \vcenter{\openupl\jot \m@th 

\ialign{\strut\hf il$\displaystyle{##}$&$\displaystyle{{}##}$\hf il 
\crcr#l\crcr}}\ ,} 

\newif \ifdt@p 

\def \displ@y{\global\dt@ptrue \openupl\jot \m@th 
\everycr{\noalign{\ifdt@p \global\dt@pfalse 

\vskip-\lineskiplimit \vskip\normallineskiplimit 
\else \penalty\interdisplaylinepenalty \fi}}} 

\def \<§lign{\tabskip=Opt\everycr={}} '/, restore inside \displ@y 
\def \display lines# l{\displ@y 

\halign{\hbox to\displaywidth{$\hf il\@lign\di splay sty le##\hfil$}\crcr 
#l\crcr}} 

\def \eqalignno#l{\displ@y \tabskip=\ centering 

\halign to\displaywidth{\hf il$\@lign\di splay sty le{##}$\tabskip=Opt 
&$\@lign\displaystyle{{}##}$\hfil\tabskip=\ centering 
&\llap{$\@lign##$}\tabskip=Opt\crcr 
#l\crcr}} 

\def \leqalignno#l{\displ@y \tabskip=\centering 

\halign to\displaywidth{\hf il$\@lign\di splay sty le{##}$\tabskip=Opt 
&$\@lign\displaystyle{{}##}$\hf il\tabskip=\ centering 
&\kern-\displaywidth\rlap{$\@lign##$}\tabskip=\displaywidth\crcr 
#l\crcr}} 

The value of \lineskiplimit is assumed to be \normallineskiplimit plus the ac- 
cumulated amount of “opening up.” Thus, the \vskip instructions in \displ@y will 
compensate for the fact that the first baseline of an alignment is separated by an 
opened-up baselineskip from the last line preceding the display. 

7. Macros for output. The plain.tex file also contains the output routine described in 
Chapters 15 and 23. First there are simple facilities related to page numbers, headings, 
and footings: 

\countdef\pageno=0 \pageno=l '/, first page is number 1 
\newtoks\headline \headline={\hf il} '/, headline is normally blank 
\newtoks\f ootline \f ootline={\hss\tenrm\folio\hss} 

'/, footline is normally a centered page number in font \tenrm 
\def \f olio{\ifnum\pageno<0 \romannumeral-\pageno \else\number\pageno \fi} 
\def \nopagenumbers{\footline={\hf il}} '/, blank out the footline 
\def \advancepageno{\ifnum\pageno<0 \global\advance\pageno by -1 
\else\global\advance\pageno by 1 \fi} '/, increase I pageno I 
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\newif \ifr@ggedbottom 

\def \raggedbottom{\topskiplOpt plus60pt \r@ggedbottomtrue} 

\def \normalbottom{\topskiplOpt \r@ggedbottomf alse} '/, undoes \raggedbottom 

The \footnote macro lias a few subtle features that can best be appreciated 
by someone who reads Chapter 15 very carefully. It also uses some \bgroup and 
\futurelet and \aftergroup trickery, so that the footnote text does not need to be a 
parameter to \vfootnote: 

\newinsert\f ootins 

\def \f ootnote#l{\let\@sf =\empty '/, parameter #2 (the text) is read later 
\ifhmode\edef\@sf{\spacefactor=\the\spacef actor}\/\f i 
#l\@sf \vfootnote{#l}} 

\def\vfootnote#l{\ insert \footins\bgroup 

\interlinepenalty=\interfootnotelinepenalty 

\splittopskip=\ht\strutbox '/, top baseline for broken footnotes 
\splitmaxdepth=\dp\strutbox \floatingpenalty=20000 
\leftskip=Opt \rightskip=Opt \spaceskip=Opt \xspaceskip=Opt 
\textindent{#l}\footstrut\futurelet\next\f o@t} 

\def \f o@t{\if cat\bgroup\noexpand\next \let\next\f @@t 
\else\let\next\f @t\f i \next} 

\def \f @@t{\bgroup\aftergroup\@f oot\let\next} 

\def \f @t#l{#l\@f oot} 

\def \@foot{\strut\egroup} 

\def \f oot strut {\vbox to\splittopskip{}} 

\skip\footins=\bigskipamount '/, space added when footnote is present 
\count\f ootins=1000 '/, footnote magnification factor (1 to 1) 

\dimen\f ootins=8in */, maximum footnotes per page 

Floating insertions are handled by doing an \insert whose vertical list consists 
of a penalty item followed by a single box: 

\newinsert\topins \newif \ifp@ge \newif\if <§mid 
\def \topinsert{\@midfalse\p@gef alse\@ins} 

\def \midinsert{\@midtrue\@ins} 

\def \pageinsert{\@midf alse\p@getrue\@ins} 

\skip\topins=Opt '/, no space added when a topinsert is present 
\count\topins=1000 '/, magnification factor (1 to 1) 

\dimen\topins=\maxdimen '/, no limit per page 

\def \<§ins{\par\begingroup\setboxO=\vbox\bgroup} '/, start a \vbox 
\def \endinsert{\egroup '/, finish the \vbox 

\if@mid \dimen@=\htO \advance\dimen<§ by\dp\z<§ \advance\dimen@ byl2\p@ 
\advance\dimen@ by\pagetotal \advance\dimen@ by-\page shrink 
\ifdim\dimen<§>\pagegoal \@midf alse\p@gef alse\f i\f i 
\if@mid \bigskip \boxO \bigbreak 

\else\insert\topins{\penaltylOO '/, floating insertion 

\splittopskip=Opt \splitmaxdepth=\maxdimen \f loatingpenalty=0 
\ifp@ge \dimen@=\dpO 

\vbox to\vsize{\unvboxO \kern-\dimen@} '/, depth is zero 
\else \boxO \nobreak\bigskip\f i}\f i\endgroup} 
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Most of the \output routine appears in Chapter 23; it is given here in full: 
\output={\plainoutput} 

\def\plainoutput{\shipout\vbox{\makeheadline\pagebody\makef ootline}'/, 
\advancepageno 

\ifnum\outputpenalty>-20000 \else\dosupereject\f i} 

\def \pagebody{\vbox to\vsize{\boxmaxdepth=\maxdepth \pagecontents}} 

\def \makeheadline{\vbox to 0pt{\vskip-22 . 5pt 

\line{\vbox to8. 5pt{}\the\headline}\vss}\nointerlineskip} 

\def \makef ootline{\baselineskip=24pt \line{\the\f ootline}} 
\def\dosupereject{\ifnum\insertpenalties>0 '/, something is being held over 
\line{}\kern-\topskip\nobreak\vf ill\supere ject\f i} 

\def \pagecontents{\ifvoid\topins\else\unvbox\topins\f i 
\dimen@=\dp255 \unvbox255 

\ifvoid\footins\else '/, footnote info is present 

\vskip\skip\f ootins \footnoterule \unvbox\f ootins\f i 
\ifr@ggedbottom \kern-\dimen@ \vfil \fi} 

\def \f ootnoterule{\kern-3pt 

\hrule width 2truein \kern 2.6pt} '/, the \hrule is .4pt high 
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8. Hyphenation and everything else. The last part of plain.tex reads the hyphenation 
patterns and exceptions found on file hyphen.tex (see Appendix H); then it defines a 
few miscellaneous macros, sets up \rm type, and that’s all! 

\lefthyphenmin=2 \righthyphenmin=3 disallow x- or -xx breaks 
\input hyphen */, the hyphenation patterns and exceptions 

\def \magnif ication{\afterassignment\m@g\count@} 

\def \m@g{\mag=\count@ 

\hsize6. 5truein\vsize8 . 9truein\dimen\footins8truein} 

\def \tracingall{\tracingonline=l \tracingcommands=2 \tracingstats=2 
\tracingpages=l \tracingoutput=l \tracinglostchars=l 
\tracingmacros=2 \tracingparagraphs=l \tracingrestores=l 
\showboxbreadth=\maxdimen \showboxdepth=\maxdimen \err or stopmode} 

\def \showhyphens#l{\setboxO=\vbox{\parf illskipOpt \hsize=\maxdimen \tenrm 
\pretolerance=-l \tolerance=-l \hbadness=0 \showboxdepth=0 \ #1}} 
\normalbaselines\rm '/, select roman font 
\nonfrenchspacing */, punctuation affects the spacing 
\catcode ‘@=12 '/, at signs are no longer letters 

\def \fmtname{plain}\def\fmtversion{3 . 0} */, identifies the current format 

The format name and version number are recorded in control sequences, in order to 
help the people who might have to explain why something doesn’t work. Macro files 
like plain.tex should not be changed in any way, except with respect to preloaded 
fonts, unless the changes are authorized by the author of the macros. 
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PERLIS 


The purpose of a programming system is to make a computer easy to use. 

To do this, it furnishes languages and various facilities 
that are in fact programs invoked and controlled by language features. 

But these facilities are bought at a price: 
the external description of a programming system is ten to twenty times 
as large as the external description of the computer system itself. 
The user finds it far easier to specify any particular function, 
but there are far more to choose from, 
and far more options and formats to remember. 
— FREDERICK P. BROOKS, JR., The Mythical Man Month (1975) 

When someone says, "I want a programming language 
in which I need only say what I wish done," 
give him a lollipop. 
— ALAN PERLIS, Epigrams on Programming (1982) 
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Different, computers tend to have different ways of representing the characters in files of codes 

text, but TjgX gives the same results on all machines, because it converts everything to alphabetic constants 

a standard internal code when it reads a file. T^X also converts back from its internal ASCII 

representation to the appropriate external code, when it writes a file of text; therefore 

most users need not be aware of the fact that the codes have actually switched back 

and forth inside the machine. 

The purpose of this appendix is to define TgX’s internal code, which has the 
same characteristics on all implementations of TJgX. The existence of such a code is 
important, because it makes T^rX constructions “portable.” For example, T^X allows 
alphabetic constants like ‘b to be used as numbers; the fact that ‘b always denotes 
the integer 98 means that we can write machine-independent macros that decide, for 
instance, whether a given character is a digit between 0 and 9. Furthermore the internal 
code of TgX also survives in its dvi output files, which can be printed by software that 
knows nothing about where the dvi data originated; essentially the same output will 
be obtained from all implementations of T(?X, regardless of the host computer, because 
the dvi data is expressed in a machine-independent code. 

TjgX’s internal code is based on the American Standard Code for Information 
Interchange, known popularly as “ASCII.” There are 128 codes, numbered 0 to 127; 
we conventionally express the numbers in octal notation, from '000 to ' 1 77, or in 
hexadecimal notation, from "00 to "7F. Thus, the value of ‘b is normally called '142 or 
"62, not 98. In the ASCII scheme, codes '000 through '040 and code '177 are assigned 
to special functions; for example, code '007 is called BEL, and it means “Ring the bell.” 

The other 94 codes are assigned to visible symbols. Here is a chart that shows ASCII 
codes in such a way that octal and hexadecimal equivalents can easily be read off: 
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Ever since ASCII was established in the early 1960s, people have had dif- uparrow uparrow 
ferent ideas about what to do with positions ' 000 -'037 and ' 17 7, because most of 
the functions assigned to those codes are appropriate only for special purposes like file 
transmission, not for applications to printing or to interactive computing. It turned out 
that manufacturers soon started producing line printers that were capable of generating 
128 characters, 33 of which were tailored to the special needs of particular customers; 
part of the advantage of a standard code was therefore lost. On the other hand, the 
remaining 95 codes (including ' 40 = SP, a blank space) have become widely adopted, 
and they are now implanted within most of today’s computer terminals. When an 
ASCII keyboard is available, you can specify each of the 128 codes to Tp;X in terms of 
the 95 standard characters, as follows: 



' 0 

'1 

'2 

'3 

'4 

'5 

'6 
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'00x 
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"8 

"9 

"A 

"B 

"C 

"D 

"E 

"F 



(Here doesn’t necessarily mean two circumflex characters; it means two identical 
characters whose current \catcode is 7. In such cases IjjX simply adds or subtracts 
'100 from the internal code of the character that immediately follows. For example, 
* can also be typed as “j; j can also be typed as “*.) 

An extended ASCII code intended for text editing and interactive computing 
was developed at several universities about 1965, and for many years there have been 
terminals in use at Stanford, MIT, Carnegie-Mellon, and elsewhere that have 120 or 121 
symbols, not just 95. Aficionados of these keyboards (like the author of this book) are 
loath to give up their extra characters; it seems that such people make heavy use of 
about 5 of the extra 25, and occasional use of the other 20, although different people 
have different groups of five. For example, the author developed T^X on a keyboard 
that includes the symbols 4, #, <, and >, and he finds that this makes it much more 
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pleasant to type class notes, technical papers, and computer programs of the kind he 
likes to write; his logician friends make heavy use of the V and 3 keys; and so on. It 
is recommended that TjjX implementations on systems with large character sets be 
consistent with the following codes: 



'0 

'1 
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Of course, designers of TjrjX macro packages that are intended to be widely used should 
stick to the standard ASCII characters. 

Incidentally, the ASCII character * that appears in position '136 is sometimes 
called a “caret,” but dictionaries of English tell us that a caret is a larger symbol, more 
like character '004 in the extended set above. The correct name for * is “circumflex,” 
but this is quite a mouthful, so a shorter name like “hat” is preferable. It seems 
desirable to preserve the traditional distinction between caret and hat. 

The extended code shown above was developed at MIT; it is similar to, but 
slightly better than, the code implemented at Stanford. Seven of the codes are con- 
ventionally assigned to the Standard ASCII control functions NUL ((null)), HT ((tab)), 
LF ((linefeed)), FF ((formfeed)), CE ((return)), ESC ((escape)), and DEL ((delete)), and 
they appear in the standard ASCII positions; hence the corresponding seven charac- 
ters • 7 8 ± © O / do not actually appear on the keyboard. These seven “hidden” 
characters show up only on certain output devices. 

Modern keyboards allow 256 codes to be input, not just 128; so Tj?X represents 
characters internally as numbers in the range 0-255 (i.e., '000-'3 77, or "00-"FF). 
Implementations of I/ 5 X differ in which characters they will accept in input files and 
which they will transmit to output files; these subsets can be specified independently. A 
completely permissive version of TpX allows full 256-character input and output; other 
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versions might ignore all but. the visible characters of ASCII; still other versions might 
distinguish the tab character (code 'Oil ) from a space on input, but might output each 
tab as a sequence of three characters “I. 

Many people, unfortunately, have the opposite problem: Instead of the 95 stan- 
dard characters and some others, they have fewer than 95 symbols actually available. 
What can be done in such cases? Well, it’s possible to use TgX with fewer symbols, 
by invoking more control sequences; for example, plain TjgX defines \lq, \rq, \lbrack, 
\rbrack, \sp, and \sb, so that you need not type ‘, ’, [,],*, and _, respectively. 

A person who implements TjjX on computer systems that do not have 95 ex- 
ternally representable symbols should adhere to the following guidelines: (a) Stay as 
close as possible to the ASCII conventions. (b) Make sure that codes '041-'046 , 
' 060-'071 , '141-'146, and '160-'171 are present and that each unrepresentable in- 
ternal code < '200 leads to a representable code when '100 is added or subtracted; 
then all 256 codes can be input and output, (c) Cooperate with everyone else who 
shares the same constraints, so that you all adopt the same policy. (See Appendix J 
for information about the T^?X Users Group.) 

Very few conventions about character codes are hardwired into T^X: Al- 
most everything can be changed by a format package that changes parameters like 
\escapechar and sets up the \catcode, \mathcode, \uccode, \lccode, \sfcode, and 
\delcode tables. Thus a T)?X manuscript that has been written in Denmark, say, can 
be run in California, and vice versa, even though quite different conventions might be 
used in different countries. The only character codes that TpX actually “knows” are 
these: (1) INITEX initializes the code tables as described in Appendix B; the same 
initialization is done by all implementations of TjgX. (2) TjgX uses the character codes 
u+-. , ‘ ’ "<=>0123456789ABCDEF in its syntax rules (Chapters 20, 24, and Appendix H), 
and it uses most of the uppercase and lowercase letters in its keywords pt, to, plus, 
etc. These same codes and keywords are used in all implementations of T^jX. For 
example, when T)gX is implemented for Cyrillic keyboards, the letter ‘n’ should be as- 
signed to code '160 and ‘t’ to code '164, so tliat. ‘m’ still means ‘pt’; or else control 
sequences should be defined so that what TpX sees is equivalent to the keywords it 
needs. (3) The operations \number, \romannumeral, \the, and \meaning can generate 
letters, digits, spaces, decimal points, minus signs, double quotes, colons, and “>’ signs; 
these same codes are generated in all implementations of T^X. (4) The \hyphenation 
and \pattern commands described in Appendix H give special interpretation to the 
ten digits and to the characters and (5) The codes for the four characters 

$ . { } are inserted when TpX recovers from certain errors, and braces are inserted 
around an \output routine; appropriate catcodes are attached to these tokens, so it 
doesn’t matter if these symbols have their plain T)?X meanings or not. (6) There is 
a special convention for representing characters 0-255 in the hexadecimal forms ~~00- 
“ff , explained in Chapter 8. This convention is always acceptable as input, when “ 
is any character of catc.ode 7. Text output is produced with this convention only when 
representing characters of code > 128 that a TJrX installer has chosen not to output 
directly. 
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ASCII 

REID 


Code sets obtained by modifying the standard as shown above 
or by other replacements are nonstandard. 
— ASA SUBCOMMITTEE X3.2, American Standard 
Code for Information Interchange (1963) 

Both the Stanford and DEC uses of the ASCII control characters 
are in violation of the USA Standard Code, 
but no Federal Marshal is likely to come running out 
and arrest people who type control-T to their computers. 
— BRIAN REID, SCRIBE Introductory User’s Manual (1978) 
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TgX was designed to do the ordinary tasks of typesetting: to make paragraphs 
and pages. But the underlying mechanisms that facilitate ordinary typesetting — 
e.g.. boxes, glue, penalties, and macros — are extremely versatile; hence people 
have discovered sneaky ways to coerce TgX into doing tricks quite different from 
what its author originally had in mind. Such clever constructions are not gen- 
erally regarded as examples of “high T^X” ; but many of them have turned out 
to be useful and instructive, worthy of being known (at least by a few wizards). 
The purpose of this appendix is to introduce crafty and/or courageous readers 
to the nether world of Tp)X arc: aria. 

(&> (&> (&> P^ ease don’t read this material until you’ve 

YYYYYYYYYYY had plenty of experience with plain TfrX. 

After you have read and understood the secrets below, you’ll know all sorts of devious 
combinations of TJ?X commands, and you will often be tempted to write inscrutable 
macros. Always remember, however, that there’s usually a simpler and better way to 
do something than the first way that pops into your head. You may not have to resort 
to any subterfuge at all, since TfrX is able to do lots of things in a straightforward way. 
Try for simple solutions first. 

1. Macro madness. If you need to write complicated macros, you’ll need to be familiar 
with the fine points in Chapter 20. Tf^X’s control sequences are divided into two main 
categories, “expandable” and “unexpandable” ; the former category includes all macros 
and \if . . .\fi tests, as well as special operations like \the and \input, while the lat- 
ter category includes the primitive commands listed in Chapter 24. The expansion of 
expandable tokens takes place in IfrX’s “mouth,” but primitive commands (including 
assignments) are done in T^X’s “stomach.” One important consequence of this struc- 
ture is that it is impossible to redefine a control sequence or to advance a register while 
IfjX is expanding the token list of, say, a \message or \write command; assignment 
operations are done only when TyX is building a vertical or horizontal or math list. 

For example, it’s possible to put \n asterisks into a paragraph, by saying 
simply ‘{\loop\ifnum\n>0 *\advance\n-l \repeat}’. But it’s much more difficult to 
define a control sequence \asts to consist of exactly \n consecutive asterisks. If \n 
were known to be at most 5, say, it would be possible to write 

\edef \asts{\if case\n\or*\or**\or***\or****\or*****\else\bad\f i} 

since TgX handles \ifcase in its mouth. But for general \n it would be impossible 
to use a construction like ‘\edef\asts{\loop\ifnum\n>0 *\advance\n-l \repeat}’, 
since \n doesn’t change during an \edef. A more elaborate program is needed; e.g., 

{\xdef\asts{} 

\loop\ifnum\n>0 \xdef \asts{\asts*}\advance\n-l \repeat} 

And here’s another solution (which is faster, because token list registers can be ex- 
panded more quickly than macros, using \the): 

\newcount\m \newtoks\t \m=\n \t={} 

\loop \ifnum\m>0 \t=\expandafter{\the\t *} \advance\m-l \repeat 
\edef \asts{\the\t} 
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However, both of these solutions have a running time proportional to the 
square of \n. There’s a much quicker way to do the job: 

\begingroup\aftergroup\edef \aftergroup\asts\aftergroup{ 

\loop \ifnum\n>0 \aftergroup*\advance\n-l \repeat 

\aftergroup}\endgroup 

Get it? The \aftergroup commands cause a whole list of other tokens to be saved up 
for after the group! This method has only one flaw, namely that it takes up \n cells of 
space on TjrX’s input stack and \n more on IjrX’s save stack; lienee a special version 
of TfrX may be required when \n is larger than 150 or so. 

(Incidentally, there’s a completely different way to put \n asterisks into a 
paragraph, namely to say ‘\setboxO=\hbox{*}\cleaders\copyO\hskip\n\wdO’. This 
may seem to be the fastest solution of all; but actually it is not so fast, when all things 
are considered, since it generates four bytes of dvi output per asterisk, compared to 
only one byte per asterisk in the other methods. Input/output time takes longer than 
computation time, both in TjrX itself and in the later stages of the printing process.) 

The problem just solved may seem like a rather special application; after all, 
who needs a control sequence that contains a variable number of asterisks? But the 
same principles apply in other similar cases, e.g., when you want to construct a variable- 
length \parshape specification. Similarly, many of the “toy problems” solved below 
are meant to illustrate paradigms that can be used in real-life situations. 

The precise rules for expansion are explained in Chapter 20; and the best way 
to get familiar with Tj^X’s expansion mechanism is to watch it in action, looking at the 
log file when \tracingmacros=2 and \tracingcommands=2. One of the important ways 
to change the normal order of expansion is to use \expandafter; the construction 

\expandafter\a\b 

causes \b to be expanded first, then \a. And since \expandafter is itself expandable, 
the construction 

\expandafter\expandafter\expandafter\a\expandafter\b\c 
causes \c to be expanded first, then \b, then \a. (The next step, 

\expandafter\expandafter\expandafter\expandafter 

\expandafter\expandafter\expandafter\a 

\expandafter\expandafter\expandafter\b\expandafter\c\d 

is probably too lengthy to be of any use.) 

It’s possible to make good use of \expandafter\a\b even when \a isn’t ex- 
pandable. For example, the token list assignment ‘\t=\expandafter{\the\t *}’ in the 
example on the previous page was able to invade territory where expansion is normally 
suppressed, by expanding after a left brace. Similarly, 

\t=\expandafter{\expandafter*\the\t} 
would have worked; and 

\uppercase\expandafter{\romannumeral\n} 
yields the value of register \n in uppercase roman numerals. 
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Here’s a more interesting example: Recall that \fontdimenl is the amount of 
“slant per point” of a font; hence, for example, ‘\the\f ontdimenl\tenit’ expands to 
‘0.25pt’, where the characters ‘pt’ are of category 12. After the macro definitions 

{\catcode ‘p=12 \catcode‘t=12 \gdef\\#lpt{#l}} 

\let\getf actor=\\ 

\def\kslant#l{\kern\expandafter\getfactor\the\f ontdimenl#l\htO} 

one can write, e.g., ‘\kslant\tenit’ and this will expand to ‘YkernO . 25\ht0’. If the 
boundary of \boxO is considered to be slanted by 0.25 horizontal units per vertical unit, 
this kern measures the horizontal distance by which the top edge of the box is skewed 
with respect to an edge at the baseline. All of the computation of \kslant is done in 
T^rX’s mouth; thus, the mouth can do some rather complicated things even though it 
cannot assign new values. (Incidentally, an indirect method was used here to define 
the control sequence \getfactor when the character t had category 12, since control 
words normally consist only of letters. The alternative construction 

{\catcode ‘p=12 \catcode‘t=12 

\ c s name expandafter\endcs name \gde f 
\csname getfactor\endcsname#lpt{#l}} 

would also have worked, since \csname and \endcsname don’t contain ‘p’ or ‘t ’ !) 

The mechanism by which T)gX determines the arguments of a macro can be 
applied in unexpected ways. Suppose, for example, that \t is a token list register that 
contains some text; we wish to determine if at least one asterisk (* 12 ) appears in that 
text. Here’s one way to do it: 

\newif\ifresult '/, for the result of a computed test 

\def\atest#l{\expandafter\a\the#l*\atest\a} 

\long\def \a#l*#2#3\a{\ifx\atest#2\resultfalse\else\resulttrue\f i} 

Now after ‘\atest\t’, the control sequence \ifresult will be \iftrue or \iffalse, 
depending on whether or not \t contains an asterisk. (Do you see why?) And here’s 
a slightly more elegant way to do the same thing, using \futurelet to look ahead: 

\def\btest#l{\expandafter\b\the#l*\bb} 

\long\def \b#l*{\futurelet\next\bb} 

\long\def \bb#l\bb{\ifx\bb\next\resultf alse\else\resulttrue\f i} 

In both cases the solution works if \t contains control sequence tokens as well as 
character tokens, provided that the special control sequences \atest, \a, and \bb 
don’t appear. Notice, however, that an asterisk is “hidden” if it appears within a 
group the test is limited to asterisks at nesting level zero. A token list register 

is always balanced with respect to grouping, so there is no danger of the test leading 
to error messages concerning missing braces or extra braces. 

We can apply the ideas in the preceding paragraph to solve a problem related 
to generalized math formatting: The goal is to set TgK up so that the respective 
constructions ‘$$cv$$’, ‘ $$ a \eqno /? $$ ’, and 1 $$ a \leqno /3 $$ ’ will cause a macro 
$$\generaldisplay$$ to be invoked, with \eq defined to be a; furthermore, the test 
\ifeqno should be true when an equation number f3 is present, and \ifleqno should 
be true in the case of \leqno. When (3 is present, it should be stored in \eqn. Here 


fontdimen 

slant 

the 

getfactor 

csname 

endcsname 

arguments 

long 

newif 

futurelet 

group 

displays, non-centered 

dollar dollar 

eqno 

leqno 

communication between macros 



376 Appendix D: Dirty Tricks 


o and f3 are arbitrary balanced token lists that don’t contain either \eqno or \leqno 
at nesting level zero. The following macros do the required maneuvers: 

\newif\if eqno \newif \ifleqno \everydisplay{\displaysetup} 

\def \di splay setup#l$${\displaytest#l\eqno\eqno\displaytest} 
\def\displaytest#l\eqno#2\eqno#3\di splay test{'/, 

\if ! #3 ! \ldisplaytest#l\leqno\leqno\ldisplaytest 
\else\eqnotrue\leqnof alse\def\eqn{#2}\def \eq{#l}\f i 
\generaldisplay$$} 

\def\ldisplaytest#l\leqno#2\leqno#3\ldisplaytest{\def \eq{#l}'/, 

\if ! #3 ! \eqnof alse\else\eqnotrue\leqnotrue\def \eqn{#2}\f i} 

An examination of the three cases $$ a $$, $$ cAeqno 8 $$, $$ o:\leqno § $$ shows that 
the correct actions will ensue. Parameter #3 in the tests ‘\if ! #3 ! ’ will be either empty 
or \eqno or \leqno; thus, the condition will be false (and the second ‘ ! ’ will be skipped) 
unless #3 is empty. 

Returning to the problem of *’s in \t, suppose that it’s necessary to consider 
*’s at all levels of nesting. Then a slower routine must be used: 

\def\ctest#l{\resultf alse\expandafter\c\the#l\ctest} 
\def\c{\afterassignment\cc\let\next= } 

\def\cc{\ifx\next\ctest \let\next\relax 

\else\ifx\next*\resulttrue\f i\let\next\c\f i \next} 

Here \afterassignment has been used to retain control after a non-future Met; the 
‘= ’ ensures that exactly one token is swallowed per use of \c. This routine could 
be modified in an obvious way to count the total number of *’s and/or tokens in \t. 
Notice the ‘\let\next’ instructions in \cc; it should be clear why the alternative 

\def\cc{\ifx\next\ctest\else\ifx\next*\resulttrue\f i\c\f i} 

would not work. (The latter \c would always swallow a ‘\fi’.) 

Space tokens are sometimes anomalous, so they deserve special care. The 
following macro \futurenonspacelet behaves essentially like \futurelet except that 
it discards any implicit or explicit space tokens that intervene before a nonspace is 
scanned: 

\def\futurenonspacelet#l{\def \cs{#l}'/, 

\afterassignment\stepone\let\nexttoken= } 

\def\\{\let\stoken= } \\ '/, now \stoken is a space token 
\def\stepone{\expandafter\futurelet\cs\steptwo} 
\def\steptwo{\expandafter\ifx\cs\stoken\let\next=\stepthree 
\else\let\next=\nexttoken\f i \next} 
\def\stepthree{\afterassignment\stepone\let\next= } 

An operation like \futurenonspacelet is useful, for example, when implementing 
macros that have a variable number of arguments. 

Notice that ‘\def\stepthree#l{\stepone}’ would not work here, because 
of l^X’s rule that a u io token is bypassed if it would otherwise be treated as an 
undelimited argument. Because of this rule it is difficult to distinguish explicit space 
tokens from implicit ones. The situation is surprisingly complex, because it’s possible 
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to use \uppercase to create “funny space” tokens like * 10 ; for example, the commands 
\uccode‘ ='* \uppercase{\uppercase{\def\f spacef }\let\ftoken= } } 

make \fspace a macro that expands to a funny space, and they make \ftoken an 
implicit funny space. (The tests \if\fspace*, \if \ftoken*, \if cat\f space\stoken, 
and \ifcat\ftoken\stoken will all be true, assuming that * has category 12; but if * 
has category 10, \if\fspace* will be false, because T£X normalizes all newly created 
space tokens to u io, as explained in Chapter 8.) Since the various forms of space 
tokens are almost identical in behavior, there’s no point in dwelling on the details, f 

The argument to \ write is expanded when a \shipout occurs, but sometimes 
expansion isn’t desired. Here’s a macro (suggested by Todd Allen) that suppresses all 
expansion, by inserting \noexpand before each control sequence or active character. 
The macro assumes that " is an active character, and that the tokens being written do 
not include implicit spaces or braces. Funny spaces are changed to ordinary ones. 

\long\def \unexpandedwrite#l#2{\def \f inwrit e{\write#l}'/, 

{\aftergroup\f inwrite\aftergroup{\sanitize#2\endsanity}}} 
\def\sanitize{\futurelet\next\sanswitch} 
\def\sanswitch{\ifx\next\endsanity 

\else\if cat\noexpand\next\stoken\aftergroup\space\let\next=\eat 
\else\if cat\noexpand\next\bgroup\aftergroup{\let\next=\eat 
\else\if cat\noexpand\next\egroup\aftergroup}\let\next=\eat 
\else\let\next=\copytoken\f i\f i\f i\f i \next} 
\def\eat{\afterassignment\sanitize \let\next= } 

\long\def \copytoken#l{\if cat\noexpand#l\relax\aftergroup\noexpand 
\else\if cat\noexpand#l\noexpand~\aftergroup\noexpand\f i\f i 
\aftergroup#l\sanitize} 

\def\endsanity\endsanity{} 

As before, the heavy use of \aftergroup in \unexpandedwrite means that parameter 
#2 should not include more than about 150 tokens. 
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f The following little program is for r I [.;X exegetes who insist on learning the whole 
story: Macro \stest decides whether or not the first token of a given token list 
register is a (space token) as defined in Chapter 24. If so, the macro decides whether 
or not the token is “funny,” i.e., whether or not the character code is different from an 
ASCII (space); the macro also decides whether a space token is explicit or implicit. 

\nevif\if space \newif \iff unny \newif\if explicit 
\def \stest#lf\expandafter\s\the#l ! \stest} 

\def \sf\f unnyf alse \global\explicitf alse \f uturelet\next\ss} 

\def \ssf\if cat\noexpand\next\stoken \spacetrue 

\if x\next\stoken \let\next=\sss \else\let\next=\ssss \fi 
\else \let\next=\sssss \fi \next} 

\long\def \sss#l #2\stestf\def \next{#l}% 

\if x\next\empty \global\explicittrue \fi} 

\long\def \ssss#l#2\stestf\f unny true {\uccode ‘#1=‘ " 

\uppercasef\if cat\noexpand#l}\noexpand“°/ 0 active funny space 
\else \escapechar=\if *#1 ‘?\else‘*\fi 

\if#l\string#l\global\explicittrue\f i \f i}} 

\long\def \sssss#l\stest{\spacef alse} 
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2. List macros. The next several macros we shall discuss can be used to maintain lists 
of information in the form 

\\{(itemi)}\\{(item 2 )} ... \\{(item„)} 

where each (item) is a balanced list of tokens. A parameterless control sequence whose 
replacement text has this form may be called a list macro. The empty list macro has 
n = 0 and it is called \empty. 

It’s easy to add new items at either end of a list macro, and to concatenate 
list macros, for example as follows: 

\toksdef\ta=0 \toksdef \tb=2 '/, token list registers for temp use 
\long\def \leftappenditem#l\to#2{\ta={\\{#l}}\tb=\expandafter{#2}'/, 
\edef #2{\the\ta\the\tb}} 

\long\def \rightappenditem#l\to#2{\ta={\\{#l}}\tb=\expandafter{#2}*/, 
\edef #2{\the\tb\the\ta}} 

\def\concatenate#l=#2&#3{\ta=\expandafter{#2}\tb=\expandafter{#3}'/, 
\edef #l{\the\ta\the\tb}} 

Conversely, the left item of a list can be removed and placed in a control sequence by 
the Mop macro defined in the following curious way: 

\def\lop#l\to#2{\expandafter\lopoff #l\lopoff#l#2} 

\long\def \lopoff\\#l#2\lopoff #3#4{\def #4{#l}\def #3{#2}} 

For example, if \1 expands to the list ‘\\{a\b}\\{c}\\{{d}}’, the macro invocation 
\lop\l\to\z makes \1 expand to ‘\\{c}\\{{d}}’ and \z expand to ‘a\b’. The Mop 
operation should be used only when \1 is nonempty, otherwise an error will occur; to 
test if \1 is empty, one simply says ‘\ifx\l\empty’. 

The programming details of the Mop macro indicate why individual items have 
been enclosed in groups. A simpler kind of list, in which grouping is omitted 

and an extra \\ appears at the end, suffices for many purposes; one could define, for 
instance, 

\long\def \lopoff\\#l\\#2\lopoff#3#4{\def #4{#l}\def #3{\\#2}} 

and the results would be almost the same as before. In this case an empty list macro 
expands to ‘W. However, the new Mop resulting from this new Mopoff macro also 
removes a pair of braces, if the leftmost item happens to be a group; extra braces are 
included in our general scheme to prevent such anomalies. 

So far the examples we’ve considered haven’t revealed why the M’s appear in 
the general scheme; it appears that grouping by itself should be enough. But in fact, 
the \\ separators are enormously useful, because we can define \\ to be any desired 
one-argument macro, and then we can execute the list! For example, here’s a way to 
count the number of items: 

\def\cardinality#l\to#2{#2=0 \long\def\\##l{\advance#2 byl }#1} 

(Parameter #2 is supposed to be the name of a count register.) And here’s a way to 
take a list macro and center all its items on individual lines within a \vbox: 

\def\centerlist#l{\def\\##l{\relax##l\cr}'/, 

\vbox{\halign{\hf il##\hf il\cr#l}}} 
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A particular item can be selected by its position number from the left: 

\def\select#l\of#2\to#3{\def#3{\out of range}'/, 

\long\def \\##l{\advance#l-l \ifnum#l=0 \def#3{##l}\f i}#2} 

(Here #1 is a count register, #2 is a list macro, and #3 is a control sequence.) And so 
on; hundreds of other applications can be imagined, f 

TfrX does all of the preceding operations efficiently, in the sense that the 
running time will be proportional to the length of the list macro involved. It’s natural 
to ask if the rightmost item can be removed with equal efficiency, since the final item 
of a list is somewhat hard to isolate. There is apparently no way to delete the nth 
item of an n-itern list in order n steps, maintaining complete generality, unless the 
\aftergroup trick (by which we created a macro that expands to n asterisks) is used; 
and the \aftergroup trick is somewhat unattractive in the list application, because 
the list might be quite long.J However, if we restrict list items to unexpandable tokens, 
it turns out to be possible to remove the rightmost item quite efficiently: 

\def\deleterightmost#l{\edef#l{\expandafter\xyzzy#l\xyzzy}} 
\long\def \xyzzy\\#l#2{\ifx#2\xyzzy\yzzyx 
\else\noexpand\\{#l}\f i\xyzzy#2} 

\long\def \yzzyx#l\xyzzy\xyzzy{\f i} 

Careful study of this example shows that TgX’s mouth is capable of doing recursive 
operations, given sufficiently tricky macros. 

The contents of a \ count register can easily be converted to decimal and stored 
in a control sequence; for example, if \n is a register, ‘\edef\csn{\the\n}’ puts its value 
into \csn. Conversely, a value from \csn can be put back into \n by saying simply 
‘\n=\csn’. There’s usually no point in doing this transformation just to minimize the 
usage of \ count registers, since TpX has 256 of them; but a decimal representation 
like the expansion of \csn can be stored in a list macro, and that might be useful in 
some applications. Incidentally, there’s a neat way to test if such a control-sequence- 
number is zero: ‘\ifO\csn(true text)\else(false text)\fi’ works because extra digits 
of a nonzero number will be ignored with the (true text). 

A technique something like list macros can be used to maintain unordered sets 
of control sequences. In this case it’s convenient to leave off the braces; for example, 

\def\l{\\\alpha\\\beta\\\gamma} 

defines a “set macro” \1 that represents the control sequences { \alpha, \beta, \garama }. 
A straightforward construction tests whether a given control sequence is in the set: 

\def\ismember#l\of#2{\resultf alse\def \given{#l}*/, 

\def\\##l{\def\next{##l}\ifx\next\given\resulttrue\f i}#2} 

And an efficient but not-so-straightforward construction removes all occurrences of 
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f The concept of a list macro is strongly related to the concept of a list procedure in 
a programming language; see Communications of the ACM 7 (1964), 280. 

t The interested reader may enjoy constructing a macro that removes the kth item of an 
n-item list macro \1 in 0(n log n) steps, given k and \1, without using \aftergroup. 
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control sequences that are Mfx-equivalent to a given control sequence: 

\def\remequivalent#l\from#2{\let\given=#l'/, 

\ifx#2\empty\else\edef#2{\expandafter\plugh#2\plugh}\f i} 
\def\plugh\\#l#2{\ifx#l\given\else\noexpand\\\noexpand#l\f i 
\ifx#2\plugh\hgulp\f i\plugh#2} 

\def\hgulp\f i\plugh\plugh{\f i} 

3. Verbatim listing. Plain TjjX includes a macro called \dospecials that is essentially 
a set macro, representing the set of all characters that have a special category code. 
(The control sequence \do plays the role of \\ in the discussion above.) Therefore it’s 
easy to change all of the special characters to category 12 (other): 

\def \uncatcodespecials{\def \do##l{\catcode ‘ ##1=12 }\dospecials} 

This works even when the set of special characters lias been changed, provided that 
\dospecials has been updated to represent the current set. 

The operation \uncatcodespecials just defined is important, of course, when 
TjvX’s automatic features need to be temporarily disabled. Let’s suppose that we want 
to create a listing of some computer file, reproducing the characters and the spacing 
exactly as they appear in the file. To make the problem more interesting, let’s also print 
line numbers in front of each line, as in the listing of story.tex on page 24. To make 
the problem simpler, let’s assume that the file contains only standard ASCII printing 
characters: no tab marks or form feeds or such things. Our goal is to devise a Misting 
macro such that, e.g., ‘Misting! story}’ will insert a listing of the story.tex file into 
a manuscript, after which Tj^X’s normal conventions will be restored. The listing should 
be in \tt type. A macro of the following form meets the desired specifications: 

\def\listing#l{\par\begingroup\setupverbatim\input#l \endgroup} 

Notice that the \endgroup command here will nicely “turn off” all the weird things that 
\setupverbatim turns on. Notice also that the commands ‘\input#l \endgroup’ will 
not be listed verbatim, even though they follow \setupverbatim, since they entered 
TjjX’s reading mechanism when the Misting macro was expanded (i.e., before the 
verbatim business was actually set up). 

But what should \setupverbatim do? Well, it ought to include \obeylines, 
since this automatically inserts a \par at the end of each line that is input; it ought 
to include \uncatcodespecials, so that special characters print as themselves; and it 
ought to include \obeyspaces, so that each space counts. But we need to look carefully 
at each of these things to see exactly what they do: (1) Plain Tj?X’s \obeylines macro 
changes the \catcode of “M to \active, and then it says ‘\let“M=\par’. Since “M 
is placed at the end of each line, this effectively ends each line with \par; however, 
\obeylines doesn’t say ‘\def "~M{\par}’, so we must make any desired changes to 
\par before invoking \obeylines. (2) The \uncatcodespecials operation changes a 
space to category 12; but the \tt font has the character ‘ u ’ in the (space) position, so we 
don’t really want u i2- (3) The \obeyspaces macro in Appendix B merely changes the 
(space) character to category 13; active character u i 3 has been defined to be the same 
as \space, a macro that expands to u io- This is usually what is desired; for example, 
it means that spaces in constructions like ‘\hbox to 10 pt won’t cause any 

trouble. But in our application it has an undesirable effect, because it produces spaces 
that are affected by the space factor. To defeat this feature, it’s necessary either to 
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say \frenchspacing or to redefine u i3 to be the same as \ u . The latter alternative is 
better, because the former will discard spaces at the beginning of each line. 

The \setupverbatim macro should also take care of putting a line number 
into the position of the paragraph indentation. We can take care of this by introducing 
a counter variable and using \everypar, as follows: 

\newcount\lineno '/, the number of file lines listed 
\def\setupverbatim{\tt \lineno=0 

\obeylines \uncatcodespecials \obeyspaces 

\everypar{\advance\lineno byl \llap{\sevenrm\the\lineno\ \ }}} 
{\obeyspaces\global\let =\ } '/, let active space = control space 

In theory, this seems like it ought to work; but in practice, it fails in two ways. One 
rather obvious failure — at least, it becomes obvious when the macro is tested — is that 
all the empty lines of the file are omitted. The reason is that the \par command at the 
end of an empty line doesn’t start up a new paragraph, because it occurs in vertical 
mode. The other failure is not as obvious, because it occurs much less often: The \tt 
fonts contain ligatures for Spanish punctuation, so the sequences ? ‘ and ! ‘ will be 
printed as i and j respectively. Both of these defects can be cured by inserting 

\def\par{\leavevmode\endgraf} \catcode ‘\ <= \active 

before \obeylines in the \setupverbatim macro, and by defining ‘13 as follows: 

{\catcode ‘\ <= \active \gdef ‘{\relax\lq}} 

A similar scheme could be used to produce verbatim listings in other fonts; but more 
characters would have to be made active, in order to break ligatures and to compensate 
for ASCII characters that aren’t present. 

Instead of listing a file verbatim, you might want to define a \ verbatim macro 
such that ‘\verbatim{$this$ is {\it !}}’ yields ‘$this$ is {\it!}’. It’s somewhat 
dangerous to change category codes, because TpX stamps the category on each char- 
acter when that character is first, read from a file. Thus, if \ verbatim were defined by 
a construction of the form \long\def\verbatim#l{(something)}, argument #1 would 
already be converted to a list of tokens when (something) starts; \catcode changes 
would not affect the argument. The alternative is to change category codes before 
scanning the argument to \verbatim: 

\def\verbatim{\begingroup\tt\uncat code specials 
\obeyspaces\doverbatim} 

\newcount\balance 

{\catcode ‘<=1 \catcode‘>=2 \catcode ‘\{=12 \catcode ‘\}=12 
\gdef \doverbatim{<\balance=l\verbatimloop> 

\gdef \verbatimloop#l<\def \next<#l\verbatimloop>*/, 
\if#l{\advance\balance byl 
\else\if #l}\advance\balance by-1 
\ifnum\balance=0\let\next=\endgroup\f i\f i\f i\next>> 

This works; but it’s slow, and it allows verbatim setting only of text that has balanced 
braces. It would not be suitable for typesetting the examples in a book like The 
TpjXbook. (Appendix E contains the verbatim macros that were actually used.) Note 
also that if this \verbatim{. . .} macro appears in the argument to another macro like 
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\centerline, it will fail because the category codes can no longer be changed. The 
\footnote macro in Appendix B is careful to avoid scanning its argument prematurely; 
it uses \bgroup and \egroup in a somewhat tricky way, so that category code changes 
are permitted inside plain TgX footnotes. 

On the other hand, there is a fairly fast way to convert a token list to an 
almost-verbatim transcript: 

\long\def \verbatim#l{\def \next{#l}'/, 

{\tt\frenchspacing\expandafter\strip\meaning\next}} 

\def\strip#l>{} 

Tokens are stripped off in this construction since, for example, \meaning\next might 
be ‘macro :->$this$ is {\it !}’. Notice that a space will be inserted after the 
control word \it, but no space might actually have occurred there in the argument to 
\verbatim; such information has been irretrievably lost. 

One of the problems with verbatim mode is that it’s hard to stop; if we 
turn off all of TgX’s normal control capabilities, we end up “painting ourselves into 
a corner” and reaching a point of no return. The Misting macro was able to solve 
this problem because the end of a file brings an old token list back to life. Another 
solution would be to specify a certain number of lines, after which verbatim mode 
should end. Otherwise it’s necessary to put some constraint on the text, i.e., to make 
certain texts unprintable in verbatim mode. For example, here’s an approach that 
typesets everything between \beginverbatim and \endverbatim, assuming only that 
the control sequence \endverbatim does not need to be set: 

\def\beginverbatim{\par\begingroup\setupverbatim\doverbatim} 
{\catcode ‘\ | =0 \catcode ‘ \\=12 '/, I is temporary escape character 
I obeylines I gdef I doverbatim“M#l\endverbatim{#l I endgroup}} 

This construction assumes that \beginverbatim appears at the end of a line in the 
manuscript file. Argument #1 will be read entirely into T^X’s memory before anything 
happens, so the total amount of verbatim material had better not be too voluminous. 
Incidentally, it isn’t necessary to say that this macro is \long, because the \par’s 
inserted by \obeylines are really "~M’s. 

Another approach is to keep one character untouchable. For example, it’s 
possible to define things so that ‘\verbatim(char) (text)(char)’ will typeset the (text) 
verbatim, where the (text) is not supposed to contain any occurrences of the repeated 
delimiter (char): 

\def\verbatim{\begingroup\setupverbatim\doverbat im} 
\def\doverbatim#l{\def\next##l#l{##l\endgroup}\next} 

4- Selective loading of macros. Some interesting problems arise when a computer sys- 
tem acquires a large library of macro files. For example, suppose that a file macs.tex 
contains the lines 

\let\italcorr=\/ 

\def\/{\unskip\italcorr} 

because somebody thought it would be nice to allow an optional space before T^rX’s 
primitive \/ command. That’s fine, except if macs.tex is input twice; for example, two 
other macro files might both say \input macs. When those lines are processed the sec- 
ond time, \italcorr will be \let equal to a macro that expands to ‘\unskip\italcorr’, 
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and you can guess what will happen: will get into an infinite loop, stoppable only 

by interrupting the program manually. 

Fortunately there’s an easy way to prevent this problem, by placing a suitable 
interlock near the beginning of every macro file that might introduce such anomalies: 

\ifx\macsisloaded\relax\endinput\else\let\macsisloaded=\relax\f i 

Then Vnacsisloaded will be undefined at the time of the first \ifx, but the file will 
not be read twice. A different control sequence should, of course, be used for each file. 

Another difficulty with large sets of macros is that they take up space. It 
would be nice to preload every macro that every user has ever dreamed up; but 
there might not be enough room, because TjgX’s memory capacity is finite. You might 
find it necessary to hold back and to load only the macros that are really needed. 

How much memory space does a macro require? Well, there are four kinds 
of memory involved: token memory, name memory, string memory, and character 
memory. (If any of these becomes too full, it will be necessary to increase what T^X 
calls the macro memory size, the hash size, the number of strings, and/or the pool size, 
respectively; see Chapter 27.) The token memory is most important; a macro takes 
one cell of token memory for each token in its definition, including the and the *}’. 
For example, the comparatively short definition 

\def\example#l\two{\f our} 

takes five tokens: #1, [two], {i, [four], and >2- Each control sequence also takes up 
one cell of name memory, one cell of string memory, and as many cells of character 
memory as there are characters in the name (seven in the case of \example). Character 
memory is comparatively cheap; four characters, or in some cases five, will fit in the 
same number of bits as a single cell of token memory, inside the machine. Therefore 
you don’t save much by choosing short macro names. 

TgX will tell you how close you come to exceeding its current memory capacity 
if you say \tracingstats=l. For example, one of the runs that the author made while 
testing galley proofs of this appendix reported the following statistics: 

Here is how much of TeX’s memory you used: 

209 strings out of 1685 

1659 string characters out of 17636 

27618 words of memory out of 52821 

1172 multiletter control sequences out of 2500 

Consequently there was plenty of room for more macros: 52821 — 27618 = 25203 
unused cells of main memory, 2500 — 1172 = 1328 of name memory, 1685 — 209 = 1476 
of string memory, and 17636 — 1659 = 15977 of character memory. But a fairly large 
TgX was being used, and only the macros of Appendices B and E were loaded; in other 
circumstances it might have been necessary to conserve space. 

One obvious way to keep from loading too many macros is to keep the macro 
files short and to \ input only the ones that you need. But short files can be a nuisance; 
sometimes there’s a better way. For example, let’s suppose that a file contains five 
optional classes of macros called A, B, C, D, E, and that a typical user will probably 
want only at most two or three of these five; let’s design a \load macro so that, for 
example, l \load{macs}{AC}’ will load file macs.tex including options A and C but not 
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options B, D, or E. The following \load macro converts its second argument into a 
set macro called \options: 

\def\load#l#2{\let\options=\empty \addoptions#2\end \input#l } 
\def\addoptions#l{\ifx#l\end \let\next=\relax 
\else\let\\=\relax\edef \options{\options\\#l}'/, 
\let\next=\addoptions \fi \next} 

Inside the file macs.tex, a portion of code that should be loaded only under option B, 
say, can be enclosed by ‘\ifoption B . . . \fi’, where \ifoption is defined thus: 

\def\if option#l{\def\\##l{\if ##l#l\resulttrue\f i}*/, 

\resultfalse \options \ifresult} 

(This is a simple application of ideas presented earlier in this appendix.) 

However, the \ifoption. . .\fi scheme isn’t very robust, because it requires 
all of the macros in the optional part to be well nested with respect to \if . . . and \f i; 
a macro like \ifoption itself couldn’t easily be defined in such a place! There’s a better 
scheme that also runs faster, based on category code changes. This idea (due to Max 
Diaz) requires that the leftmost nonblank character on each line be either ‘V or ‘{’; 
it’s usually easy to arrange this. Furthermore, one other symbol, say ", is reserved. 
Then the text material that is to be loaded only under option B is preceded by the line 
‘\beginoption B’ and followed by a line that says ‘"endoptionalcode’. The \catcode 
for " is set to 14 (comment character), hence the "endoptionalcode line will have no 
effect if code is not being skipped. The \beginoption macro works like this: 

\def\beginoption#l{\ifoption#l\else\begingroup\swapcategories\f i} 
\def\swapcategories{\catcode ‘ \\=14 \catcode ‘\{=14 \catcode ‘\"=0 } 
\let\endoptionalcode=\endgroup 
\catcode ‘ \"=14 

Once the categories have been swapped, all lines will be skipped at high speed until the 
control sequence "endoptionalcode is encountered; then everything will be restored 
to its former state. Under this scheme, material that should be loaded only under 
both options B and D can be prefaced by both ‘\beginoption B’ and ‘\beginoption D’; 
material that should be loaded under either option B or option D (or both) can be 
prefaced by 

\beginoption B 
"oroption D 

if we define \oroption#l to be an abbreviation for ‘\ifoption#l\endgroup\f i’. 

Another kind of selective loading is sometimes appropriate, based on whether 
or not a particular control sequence is defined. In this scheme, if the control sequence 
is undefined, it should remain undefined and it should take up no space whatever in 
TpX's memory. There’s a slick way to do this, namely to say 

\ifx\cs\undef ined . . . \fi 

(assuming that \undefined has never been defined). T^rjX does not put undefined 
control sequences into its internal tables if they follow \ifx or if they are encountered 
while skipping conditional text. You can use this idea, for example, to prepare a 
bibliography for a paper, by reading a suitably arranged bibliography file; only the 
entries that correspond to defined control sequences will be loaded. 
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5. Brace hacks. Several of TJ^X’s operations depend on grouping, and you’ll want to 
know exactly what this means if you try to do certain tricky things. For example, plain 
TJrjX’s control sequences \bgroup and \egroup are “implicit braces” because they have 
been defined by 

\let\bgroup={ \let\egroup=} 

This means that you can include them in the replacement texts of definitions without 
worrying about how they nest; for example, the macros 

\def\beginbox{\setboxO=\hbox\bgroup} 

\def\endbox{\egroup\copyO } 

allow you to make a box between \beginbox and \endbox; the behavior is almost the 
same as 
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\def \beginbox#l\endbox{\setboxO=\hbox{#l}\copyO } 

but different in three important ways: (1) The first alternative allows category codes 
to change inside the box. (2) The first alternative is faster, because it doesn’t need 
to scan the box contents both as an argument and as a sequence of actual commands. 
(3) The first alternative takes less memory space, because no argument needs to be 
stored. Thus, the first alternative is usually superior. 

For the purposes of this discussion we shall assume that only ‘{’ has category 1 
and that only has category 2, although any characters can actually be used as group 
delimiters. Group nesting is crucial during two of Tj^X’s main activities: (a) when T|-X 
is scanning a (balanced text), e.g., when is forming the replacement text of a 
macro, a parameter, or a token list variable; (b) when T)gX must determine whether 
the token & or \span or \cr or \crcr is the end of an entry within an alignment. 

TpX’s mouth has two internal counting mechanisms to deal with nesting: The 
“master counter” goes up by 1 for each {i scanned by Tj?X, and down by 1 for each } 2 ; 
the “balance counter” is similar, but it is affected only by explicit {i and } 2 tokens 
that are actually contributed to a token list that is being formed. The master counter 
decreases by 1 when TJrX evaluates the alphabetic constant ‘{, and it increases by 1 
when TpX evaluates ‘ }, hence the net change is zero when such constants are evaluated. 
As a consequence of these rules, certain constructions produce the following effects: 
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The last two cases produce no begin-group tokens when expanded, but they do affect 
the master counter as shown. Thus, for example, 

\def \eegroup{\if numO= ‘ {\f i}} 

makes \eegroup behave rather like \egroup, but the expansion of \eegroup also de- 
creases the master counter. 

Alignment processing uses only the master counter, not the balance counter. 
An alignment entry ends with the first & or \span or \cr or \crcr that appears when 
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the master counter has the value that was present in the counter at the beginning of iffalse 

the entry. Thus, for example, the curious construction Box maneuvers 

poor man’s bold 

\halign{\show\par#\relax\cr 

\global\let\par=\cr 

{\global\let\par=\cr}\cr 

\par} 

causes TjjX to perform three \show instructions, in which the respective values of \par 
shown are \par, \relax, and \cr. Similarly, each template in the preamble to an 
alignment ends with the first & or \cr or \crcr that appears at the master counter 
level that was in effect at the beginning of the entry; hence ft and \cr and \crcr tokens 
can appear within a template of an alignment, if they are hidden by braces (e.g. , if 
they appear in a definition). 

These facts allow us to draw two somewhat surprising conclusions: (1) If an 
alignment entry has the form ‘ a \iff alse{\f i 8 \iffalse}\f i 7 ’, it’s possible for /3 
to include ft and \cr tokens that aren’t local to a group.* (2) The construction 

{\span\iffalse}\f i 

appearing in a preamble contributes ‘{’ to the template without any net change to the 
master counter; thus, it’s very much like \bgroup, except that it produces {1 explicitly. 

If you understand (1) and (2), you’ll agree that the present appendix deserves its name. 

6. Box maneuvers. Let’s turn now from syntax to semantics, i.e. , from TjrX’s mouth to 
its gastro-intestinal tract. Sometimes an odd symbol is needed in boldface type, but it’s 
available only in a normal weight. In such cases you can sometimes get by with “poor 
man’s bold,” obtained by overprinting the normal weight symbol with slight offsets. 

The following macro typesets its argument three times in three slightly different places, 
equidistant from each other; but the result takes up just as much space as if \pmb had 
been simply \hbox: 

\def\pmb#l{\setboxO=\hbox{#l}'/, 

\kern- . 025em\copy0\kern-\wd0 
\kern. 05em\copy0\kern-\wd0 
\kern- . 025em\raise . 0433em\box0 } 

For example, ‘\pmb{$\infty$}’ yields ‘ 00 ’. The results are somewhat fuzzy, and they 
certainly are no match for the real thing if it’s available; but poor man’s bold is better 
than nothing, and once in a while you can get away with it. 

When you put something into a box register, you don’t need to put the con- 
tents of that register into your document. Thus, you can write macros that do experi- 
ments behind the scenes, trying different possibilities before making a commitment to a 
particular decision. For example, suppose you are typesetting a text in two languages, 
and you would like to choose the column widths so that the same number of lines is 
obtained in both cases. For example, the following texts balance perfectly when the 


* The token list a should not be empty, however, because TgX expands the first token 
of an alignment entry before looking at the template, in order to see if the entry 
begins with \noalign or \omit. The master counter value that is considered to be 
present at the beginning of an entry is the value in the counter just after the “u part” 
of the template has been entirely read. 
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first column is 157.1875 pt wide and the second column is 166.8125 pt wide; but the 
second column would be one line longer than the first if they were both 162 pt wide: 


A. The creative part is really more in- 
teresting than the deductive part. In- 
stead of concentrating just on finding 
good answers to questions, it’s more 
important to learn how to find good 
questions! 

B. You’ve got something there. I wish our 
teachers would give us problems like, 
“Find something interesting about x," 
instead of “Prove x." 

A. Exactly. But teachers are so conser- 
vative, they’d be afraid of scaring off 
the “grind” type of students who obe- 
diently and mechanically do all the 
homework. Besides, they wouldn’t like 
the extra work of grading the answers 
to nondirected questions. 

The traditional way is to put off all 
creative aspects until the last part 
of graduate school. For seventeen or 
more years, a student is taught exams- 
manship, then suddenly after passing 
enough exams in graduate school he’s 
told to do something original. 


A. La parte creativa es rnucho mejor que la 
deductiva. En vez de concentrarse en bus- 
car buenas respuestas a ciertas cuestiones 
es mas important© aprender a proponerse 
buenas preguntas. 

B. Me parece una buena ocurrencia. Me 
gustarfa que los profesores propusieran 
problemas del estilo de ((Encuentren algo 
interesante sobre x)) en vez de ((Demuestre 
que x ...» . 

A. Exactamente. Pero los profesores son 
tan conservadores que temerfan espan- 
tar al tipo de estudiante «apisonadora» 
que hace lo que le proponen para casa, 
obedientemente y de forma mecanica. 
Ademas, no creo que les gustase el tra- 
bajo adicional de calificar respuestas a 
preguntas abiertas. 

La forma tradicional es dejar la parte 
creativa para los cursos altos. Durante 
diecisiete anos o mas se ensena al estu- 
diante a aprobar, luego de golpe, cerca de 
la graduation, se le pide que haga algo 
original. 


Some implementations of TjgX display the output as you are running, so that 
you can choose column widths interactively until a suitable balance is obtained. It’s 
fun to play with such systems, but it’s also possible to ask TgX to compute the column 
widths automatically. The following code tries up to ten times to find a solution in 
which the natural heights of the two columns are different by less than a given value, 
\delheight. The macros \firstcol and \secondcol are supposed to generate the 
columns, and the sum of column widths is supposed to be \doublewidth. 

\newdimen\doublewidth \newdimen\delheight \newif \iff ail \newcount\n 
\newdimen\trialwidth \newdimen\lowwidth \newdimen\highwidth 
\def \balancetwocols{\lowwidth=10em '/, lower bound on \trialwidth 
\highwidth=\doublewidth \advance\highwidth-10em */, upper bound 
{\n=l \hbadness=10000 \hfuzz=\maxdimen '/, disable warnings 

\loop \maketrial \testfailure \iffail \preparenewtrial \repeat} 
\maketrial} '/, now under/overfull boxes will be shown 
\def \maketrial{\trialwidth=. 5\lowwidth \advance\trialwidth by . 5\highwidth 
\setboxO=\vbox{\hsize=\trialwidth \f irstcol} 

\setbox2=\vbox{\hsize=\doublewidth\advance\hsize-\trialwidth\ seconded}} 
\def \testf ailure{\dimenO=\htO \advance\dimen0-\ht2 
\ifnum\dimen0<0 \dimenO=-\dimenO \fi 

\ifdim\dimenO>\delheight \ifnum\n=10 \f ailf alse\else\failtrue\f i 
\else\failfalse\f i} 

\def \preparenewtrial{\ifdim\ht0>\ht2 \global\lowwidth=\trialwidth 
\else\global\highwidth=\trialwidth\f i \advance\n byl } 


Alice 

Bill 
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Neither column will be less than 10 ems wide. This code does a “binary search,” 
assuming that a column will not increase in height when it is made wider. If no 
solution is found in 10 trials, there probably is no way to obtain the desired balance, 
because a tiny increase in the width of the taller column will make it shorter than 
the other one. The values of \hbadness and \hfuzz are made infinite during the trial 
settings, because warning messages that relate to unused boxes are irrelevant; after a 
solution is found, it is computed again, so that any relevant warnings will be issued. 

When a box has been put into a box register, you can change its height, width, 
or depth by assigning a new value to the \ht, \wd, or \dp. Such assignments don’t 
change anything inside the box; in particular, they don’t affect the setting of the glue. 

But changes to a box’s dimensions can be confusing if you don’t understand 
exactly how l^X deals with boxes in lists. The rules are stated in Chapter 12, but it 
may be helpful to restate them here in a different way. Given a box and the location of 
its reference point, T^X assigns locations to interior boxes as follows: (1) If the box is 
an libox, TjgX starts at the reference point and walks through the horizontal list inside. 
When the list contains a box, TgX puts the reference point of the enclosed box at the 
current position, and moves right by the width of that box. When the list contains 
glue or kerning, etc., TJgX moves right by the appropriate amount. (2) If the box is a 
vbox, TjgX starts at the upper left corner (i.e., TgX first moves up from the reference 
point, by the height of the box) and walks through the vertical list inside. When the 
list contains a box, TgX puts the upper left corner of that box at the current position; 
i.e., T^gX moves down by the height of that box, then puts the box’s reference point at 
the current position, then moves down by the depth of the box. When the list contains 
glue or kerning, etc., T)gX moves down by the appropriate amount. 

As a consequence of these rules, we can work out what happens when the 
dimensions of a box are changed. Let \delta be a (dimen) register, and let \h and \hh 
specify horizontal lists that don’t depend on \boxO. Consider the following macro: 

\newdimen\temp \newdimen\delta 

\def\twohboxes#l{\setboxl=\hbox{\h \copyO \hh} 

\temp=#10 \advance\temp by \delta #10=\temp 
\setbox2=\hbox{\h \copyO \hh}} 

For example, \twohboxes\wd makes two hboxes, \boxl and \box2, that are identical 
except that the width of \boxO has been increased by S in \box2. What difference 
does this make? There are several cases, depending on whether #1 is \wd, \ht, or \dp, 
and depending on whether \boxO is an libox or a vbox. Case 1, \twohboxes\wd: The 
material from \hh is moved right by 6 in \box2, compared to its position in \boxl. Also 
\wd2 is 5 more than \wdl. Case 2, \twohboxes\ht: If \boxO is an libox, everything 
remains in the same position; but if \boxO is a vbox, everything in \copyO moves up 
by 6. Also \ht2 may differ from \htl. Case 3, \twohboxes\dp: Everything remains 
in the same position, but \dp2 may differ from \dpl. 

Similarly, we can work out the changes when box dimensions are changed for 
boxes within vertical lists. In this case we shall ignore the influence of interline glue by 
defining \twovboxes as follows: 

\def\twovboxes#l{ 

\ setbox l=\vbox{\v\noint erlineskip\copy (Anoint erlineskip\vv} 
\temp=#10 \advance\temp by \delta #10=\temp 

\setbox2=\vbox{\v\nointerlineskip\copy0\nointerlineskip\vv}} 


binary search 

hbadness 

hfuzz 
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What is the difference between \boxl and \box2 now? Case 1, \twovboxes\wd: 
Everything remains in the same position, but \wd2 may differ from \wdl. Case 2, 
\twovboxes\ht: If \boxO is an hbox, everything in \v moves up by 3 in \box2, com- 
pared to the corresponding positions in \boxl, if we make the reference points of the 
two boxes identical; but if \boxO is a vbox, everything in it moves up by <5, together 
with the material in \v. Also, \ht2 is 6 more than \htl. Case 3, \twovboxes\dp: If 
\vv is empty, \dp2 is (5 more than \dpl, and nothing else changes. Otherwise everything 
in \v and in \copyO moves up by <f, and \ht2 is <5 more than \htl. 

Tp,X is designed to put boxes together either horizontally or vertically, not 
diagonally. But that’s not a serious limitation, because the use of negative spacing 
makes it possible to put things anywhere on a page. For 
example, the seven points in the diagram at the right of this 
paragraph were typeset by saying simply 

\hbox{\unit=\baseline skip 
\point 0 0 
\point 0 8 
\point 0 -8 
\point -1 -2.5 
\point 4 7 
\point 4 2 
\point 1 1.5 
} 

The \point macro makes a box of width zero; hence the 
individual \point specifications can be given in any order, 
and there’s no restriction on the coordinates: 

\newdimen\unit 

\def\point#l #2 {\rlap{\kern#l\unit 
\raise#2\unit\hbox{$ 

\scriptstyle\bullet\ ; (#1 ,#2) $}}} 

If the \point specifications are not enclosed in an \hbox — i.e. , if they occur in vertical 
mode — a similar construction can be used. In this case \point should create a box 
whose height and depth are zero: 

\def\point#l #2 {\vbox to0pt{\kern-#2\unit 

\hbox{\kern#l\unit$\scriptstyle\bullet\ ; (#1 , #2)$}\vss} 
\nointerlineskip} 

(The \nointerlineskip is necessary to prevent interline glue from messing things up.) 

If you enjoy fooling around making pictures, instead of typesetting ordinary 
text, TjgX will be a source of endless frustration/amusement for you, because almost 
anything is possible if you have suitable fonts. For example, suppose you have a font 
\qc that contains four quarter circles: 


• ( 0 , 8 ) 

• ( 4 , 7 ) 

• ( 1 , 1 - 5 ) 

• ( 4 , 2 ) 

• ( 0 , 0 ) 


• (- 1 ,- 2 . 5 ) 


• ( 0 ,- 8 ) 



a = ^ b = J c=V d= r 

Each of these characters has the same height, the same width, and the same depth; the 
width and the height-plus-depth are equal to the diameter of the corresponding full 
circle. Furthermore, the reference point of each character is in a somewhat peculiar 


Ursa Major 

points with arbitrary coordinates 

coordinates 

kern 

hbox 

nointerlineskip 
quarter circles 
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place: Each quarter arc has a horizontal endpoint such that the lower edge of the curve 
is at the baseline, and a vertical endpoint such that the left edge is directly above 
or below the reference point. This convention makes it possible to guarantee perfect 
alignment between these characters and rules that meet them at the endpoints; the 
thickness of such rules should be \f ontdimen8\qc. 

Given those characters, it’s possible to devise macros \path, \L, \R, \S, and \T 
such that \path{(any string of \L’s, \R’s, \S’s, and XT’s}} produces a path that starts 
traveling East, but it turns left for each \L, right for each \R, goes straight for each \S, 
and turns backward for each \T. Thus, for example, \path{\L\T\S\T\R\L\T\S\T\R} 
yields ‘++’’ and you can also get the following effects: 

\path{\L\R\S\R\S\R\S\S\R\R} 



\path{\R\R\R\R\T\S\S\L\L\L\L\L\S\S} 


P 


\def\X{\L\T\L\L\T\L\L\T} \path{\X\X\X\X} 



Furthermore, there are operations \B and \W that make the path black (visible) and 
white (invisible), respectively: 


\path{\R\R\S 

\W\S\S\S\R\R 

\B\R\R\S\R\S\R\S\S\S\R\S\S\S\S\S\R\S\R 

\W\R\R\R\S\L\S 

\B\L\S\S\S\S} 



_) 


fontdimen 

turtle commands 

path 

catcode 

endlinechar 

ifcase 


(It may be necessary to put kerns before and after the path, since the box produced 
by Xpath may not be as wide as the actual path itself.) 

The Xpath macros work differently from Xpoint, since the boxes need not have 
zero width in this application: 

Xcatcode'X =9 \endlinechar=-l '/, ignore all spaces (temporarily) 
\newcount\dir \newdimen\y \newdimen\w 

Xnewif Xifvisible \let\B=\visibletrue \let\W=\visiblefalse 
\newbox\NE \newbox\NW \newbox\SE \newbox\SW \newbox\NS \newbox\EW 
\setbox\SW=\hbox{\qc a} \setbox\HW=\hbox{\qc b} 

\setbox\NE=\hbox{\qc c} \setbox\SE=\hbox{\qc d} 

\w=\wd\SW \dimen0=\f ontdimen8\qc 

\setbox\EW=\hbox{\kern-\dp\SW \vrule heightXdimenO width\wd\SW} \wd\EW=\w 
\setbox\NS=\hbox{\vrule height\ht\SW depth\dp\SW widthXdimenO} \wd\NS=\w 
\def \L{\if caseXdir \dy+\NW \or\dx-\SW \or\dy-\SE \or\dx+\NE\dd-4\f i \dd+l} 
Xdef \S{\if caseXdir \dx+\EW \or \dy+\NS \or \dx-\EW \or \dy-\NS \fi} 
\def\R{\if caseXdir \dy-\SW\dd+4 \or\dx+\SE \or\dy+\NE \or\dx-\NW\f i \dd-l} 
Xdef \T{\ if caseXdir \kern-\w\dd+2\or\ey-\dd+2\or\kern\w\dd-2\or\ey+\dd-2\fi} 
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\edef\dd#l#2{\global\advance\dir#l#2\space} 

\def \dx#l#2{\ifvisible\raise\y\copy#2 \if #l-\kern-2\w\f i\else\kern#l\w\f i} 
\def \dy#l#2{\ifvisible\raise\y\copy#2 \kern-\w \fi \global\advance\y#l\w} 
\def \ey#l{\global\advance\y#l\w} 

\def \path#l{\hbox{\B \dir=0 \y=Opt #1}} 

\catcode‘\ =10 \endlinechar=‘\~~M '/, resume normal spacing conventions 

\newcount\n '/, the current order in the \dragon and \nogard macros 
\def \dragon{\ifnum\n>0{\advance\n-l \dragon\L\nogard}\f i} 

\def \nogard{\ifnum\n>0{\advance\n-l \dragon\R\nogard}\f i} 

(The last three lines are not part of the \path macros, but they can be used as an 
interesting test case. To get the famous “dragon curve” of order 9, all you have to say 
is ‘\path{\dir=3 \n=9 \dragon}’.) 

Let’s turn now to another box-oriented problem. The Misting macro dis- 
cussed earlier in this appendix was restricted to listing files that contain only visible 
ASCII characters. Sometimes it’s desirable to deal with ASCII (tab) marks too, where 
a (tab) is equivalent to 1 or 2 or • • • or 8 spaces (whatever is necessary to make the 
current line length a multiple of 8). How can this be done? 

We shall assume that files can contain a special symbol that TfrX will input 
as character number 9, the ASCII (tab) code; some implementations can’t actually do 
this. If a file contains the three symbols “I, plain T)gX will normally input them as a 
single character, number 9; but in a verbatim listing of the file we naturally want such 
symbols to print as themselves, i.e., as “I. 

The following construction redefines \setupverbatim so that the previous 
Misting macro will work with (tab) characters. The idea is to keep the line-so-far 
in an libox, which can be “measured” in order to find out how many characters have 
appeared since the beginning of the line or since the most recent (tab). 

\def\setupverbatim{\tt \lineno=0 

\def\par{\leavevmode\egroup\boxO\endgraf } 

\obeylines \uncatcodespecials \obeyspaces 
\ cat code ‘ \ ‘ =\active \catcode ‘ W~I=\active 
\everypar{\advance\lineno byl 

\llap{\sevenrm\the\lineno\ \ }\startbox}} 

\newdimen\w \setboxO=\hbox{\tt\space} \w=8\wd0 '/, tab amount 
\def\startbox{\setboxO=\hbox\bgroup} 

{\catcode ‘\"~I=\active 

\gdef * * I{\leavevmode\egroup 

\dimen0=\wd0 '/, the width so far, or since the previous tab 
\divide\dimenO by\w 

\multiply\dimenO by\w '/, compute previous multiple of \w 
\advance\dimenO by\w '/, advance to next multiple of \w 
\wd0=\dimen0 \boxO \startbox}} 

(The new things in \setupverbatim are the ‘\egroup\boxO’ in the redefinition of \par; 
the ‘\catcode‘\“"I=\active’; and the ‘\startbox’ in \everypar.) The \settabs 
and \+ macros of Appendix B provide another example of how tab operations can be 
simulated by boxing and unboxing. 
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Chapter 22 explains how to put vertical rules in tables by considering the rules 
to be separate columns. There’s also another way, provided that the rules extend all 
the way from the top of the table to the bottom. For example, 

\beginvrulealign 

\tabskip=10pt 


\halign{&\strut#\hf il\cr 


These 

after 

These& after\cr 


vertical 

the 

vertical^ the\cr 

yields 

rules 

alignment 

rulesft alignment\cr 


were 

was 

werefe was\cr 


inserted 

completed! 


inserted^ completed ! \cr} 

\endvrulealign 

The magic macros in this case examine the bottom row of the alignment, which consists 
of alternating tabskip glue and boxes; each item of tabskip glue in that bottom row 
will be bisected by a vertical rule. Here’s how: 

\def \beginvrulealign{\setboxO=\vbox\bgroup} 

\def \endvrulealign{\egroup '/, now \boxO holds the entire alignment 
\setbox0=\vbox{\setbox2=\hbox{\vrule height\htO depth\dpO widthOpt} 
\unvboxO \setboxO=\lastbox '/, now \boxO is the bottom row 
\nointerlineskip \copyO */, put it back 

\global\setboxl=\hbox{} '/, initialize box that will contain rules 
\setbox4=\hbox{\unhbox0 '/, now open up the bottom row 
\loop \skipO=\lastskip \unskip '/, remove tabskip glue 
\advance\skipO by-.4pt */, rules are .4pt wide 
\divide\skipO by 2 

\global\setboxl=\hbox{\hskip\skipO\vrule\hskip\skipO 

\unhbox2\unhboxl}*/, 

\setbox2=\lastbox '/, remove alignment entry 
\ifhbox2 \setbox2=\hbox{\kern\wd2}\repeat}}*/, 
\hbox{\rlap{\boxO}\boxl}} '/, superimpose the alignment on the rules 

This method works with all alignments created by \halign{ . . . }. For alignments 
created by, say, \halign tolOOptf. . the method works only if the bottom row of 
the alignment contains all of the columns, and only if ‘\boxl’ is replaced by ‘\hbox 
tol00pt{\unhboxl}’ at the end of \endvrulealign. 

7. Paragraph maneuvers. Chapter 14 promised that Appendix D would present an ex- 
ample where ragged right and ragged left setting occur in the same paragraph. The 
following interesting example was suggested by the “Key Index” in Mathematical Re- 
views, where the entries consist of a possibly long title followed by dot leaders followed 
by a possibly long list of review numbers. If the title doesn’t fit on one line, it should 
be set ragged right, with hanging indentation on all lines after the first; if the references 
don’t all fit on one line, they should be set ragged left. For example, given the input 

ACM Symposium on Theory of Computing, Eighth Annual (Hershey, '/, 
Pa., 1976) \ : 1879 , 4813, 5414, 6918, 6936, 6937, 6946, 6951, ’/. 

6970, 7619, 9605, 10148, 11676, 11687, 11692, 11710, 13869 


rules in tables 

alignment 

tabskip glue 

unvbox 

lastskip 

unskip 

lastbox 

halign 

ragged right 

ragged left 

Key Index 

Mathematical Reviews 
index macros 
leaders 
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the following three types of output are 

ACM Symposium on 

Theory of Computing, 

Eighth Annual 
(Hershey, Pa., 1976) 

1879, 4813, 5414, 

6918, 6936, 6937, 6946, 
6951, 6970, 7619, 9605, 
10148, 11676, 11687, 
11692, 11710, 13869 


desired, depending on the column width: 


ACM Symposium on Theory of 
Computing, Eighth Annual 

(Hershey, Pa., 1976) 

1879, 4813, 5414, 6918, 6936, 6937, 
6946, 6951, 6970, 7619, 9605, 10148, 
11676, 11687, 11692, 11710, 13869 


ACM Symposium on Theory of Computing, Eighth Annual 

(Hershey, Pa., 1976) ... 1879, 4813, 5414, 6918, 6936, 6937, 6946, 
6951, 6970, 7619, 9605, 10148, 11676, 11087, 11692, 11710, 13869 


Notice that the dot leaders are treated in three different ways, depending on which 
works out best: They may occur at the left of the first line after the title, or they 
may appear at the end of the last line of the title (in which case they stop well be- 
fore the right margin), or they may occur in the middle of a line. Furthermore, the 
ragged-right lines are supposed to end at least 0.5 ern from the right margin. Our goal 
is to achieve all this as a special case of T^X’s general paragraphing method. The sim- 
ple approach of Appendix B won’t work, because \raggedright is achieved there by 
adjusting \rightskip; TgX uses the same \rightskip value in all lines of a paragraph. 

The solution to this problem requires an understanding of the line-breaking 
algorithm; it depends on how demerits are calculated, and on how items are removed 
at the breakpoints, so the reader should review Chapter 14 until those concepts are 
firmly understood. Basically, we need to specify a sequence of box/glue/penalty items 
for the spaces in the title portion, another sequence for the spaces in the reference 
portion, and another sequence for the dot leaders. In the title portion of each index 
entry, interword spaces can be represented by the sequence 

\penaltyl0000 \hskip.5em plus3em \penalty0 
\hskip-.17em plus-3em minus. Item 

Thus, there is a stretchability of 3 em if a line break occurs at the \penaltyO; otherwise 
the net interword space will be .33 em, shrinkable to .22 em. This gives ragged right 
margins. The interword spaces in the reference portion are designed to produce ragged 
left margins and to minimize the number of lines devoted to references: 

\penaltyl000 \hskip.33em plus-3em minus. llem 
\vadjust{}\penalty 10000 \hskip0pt plus3em 

The \vadjust{} does nothing, but it doesn’t disappear at a line break. Thus, if a break 
occurs at the \penaltyl000, the following line will begin with stretchability 3em; but 
if no break occurs, the net space will be .33 em minus .llem. Finally, the transition 
between title and references can be specified by 

\penaltyl0000 \hskip.5em plus3em \penalty600 
\hskip-.17em plus-3em minus. llem 
\vadjust-Q\penalty 10000 

\leaders\copy\dbox\hskip3 . 3\wd\dbox pluslfil minus . 3\wd\dbox 
\kern3em \penalty600 \hskip-2 . 67em plus-3em minus. llem 
\vadjust{}\penalty 10000 \hskip0pt plus3em 
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(Quite a mouthful.) This long sequence of penalty and glue items begins rather like 
the interword spaces in the first part, and it ends rather like the interword spaces in 
the last part. It has two permissible breakpoints, namely at the ‘\penalty600’ items. 
The first breakpoint causes the leaders to appear at the beginning of a line; the second 
causes them to appear at the end, but 3 ems away. The leader width will always be 
at least three times the width of \dbox, so at least two copies of \dbox will always 
appear. Here is the actual IgX code that can be used to set up the desired behavior: 

\hyphenpenalty 10000 \exhyphenpenaltyl0000 \pretolerancel0000 '/, no hyphens 
\newbox\dbox \setbox\dbox=\hbox to . 4em{\hss . \hss} */, dot box for leaders 
\newskip\rrskipb \rrskipb= . 5em plus3em '/, ragged right space before break 
\newskip\rrskipa \rrskipa=- . 17em plus-3em minus, llem '/, ditto, after 
\newskip\rlskipa \rlskipa=0pt plus3em '/, ragged left space after break 
\newskip\rlskipb \rlskipb= . 33em plus-3em minus .llem '/, ditto, before 
\newskip\lskip \lskip=3 . 3\wd\dbox pluslfil minus . 3\wd\dbox '/, for leaders 
\newskip\lskipa \lskipa=-2 . 67em plus-3em minus, llem */, after leaders 
\mathchardef \rlpen=1000 \mathchardef\leadpen=600 */, constants used 
\def \rrspace{\nobreak\hskip\rrskipb\penaltyO\hskip\rrskipa} 

\def \rlspace{\penalty\rlpen\hskip\rlskipb\vadjust{}\nobreak\hskip\rlskipa} 
\uccode‘~=‘ \uppercase{ 

\def \ : {\nobreak\hskip\rrskipb \penalty\leadpen \hskip\rrskipa 
\vadjust-Q\nobreak\leaders\copy\dbox\hskip\lskip 
\kern3em \penalty\leadpen \hskip\lskipa 
\vadjust{}\nobreak\hskip\rlskipa \let~=\rl space} 
\everypar{\hangindent=l . 5em \hangafter=l \let~=\rrspace}} 

\uccode‘~=0 \parindent=0pt \parf illskip=0pt \obeyspaces 

Putting the interword glue into \skip registers saves a great deal of time and memory 
space when T}rX works with such paragraphs; ‘\hskip(explicit glue)’ occupies six cells 
of TgK’s box memory, but ‘\hskip(skip register)’ occupies only two. Notice the tricky 
use of \uppercase here to convert "13 into u is; “random” active characters can be 
obtained in a similar way. 

Let’s turn now to a much simpler problem: hanging punctuation. 

“What is hanging punctuation?” asked Alice, margins, if they occur next to a line break.” 
with a puzzled frown. ‘Well, y’know, actually,’ ‘Yeah, I guess.’ “Really! But why do all your 
answered Bill, ‘I’d rather demonstrate it than remarks have single quotes, while mine are 
explain it.’ “Oh, now I see. Commas, periods, double?” ‘I haven’t the foggiest; it’s weird, 
and quotes are allowed to stick out into the Ask the author of this crazy book.’ 

Each comma in Alice and Bill’s demonstration paragraph was represented inside of TjvX 
by the sequence of three items ‘ ,\kern-\commahang\kern\commahang’, and there were 
similar replacements for periods and for closing quotes; opening quotes were represented 
by the longer sequence 

\kern\qquotehang\vadjust{}\kern-\qquotehang‘ ‘ \allowhyphens 

where \allowhyphens allows the following word to be hyphenated. This construction 
works because kerns disappear into line breaks in the proper way; the relevant rules 
from Chapter 14 are: (1) A line break can occur at a kern that is immediately followed 
by glue. (2) Consecutive glue, kern, and penalty items disappear at a break. 
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To set T^X up for hanging punctuation, you can say 

\newdimen\commahang \setboxO=\hbox{ ,} \commahang=\wdO 

\newdimen\periodhang \setboxO=\hbox{ . } \periodhang=\wdO 

\newdimen\quotehang \setboxO=\hbox{ ‘} \quotehang=\wdO 

\newdimen\qquotehang \setboxO=\hbox{ ‘ ‘ } \qquotehang=\wdO 

\newskip\zzz \def\allowhyphens{\nobreak\hskip\zzz} 

\def\lqq{‘‘} \def\rqq{’’} \def\pnt{.} 

\def \comma{ , \kern-\coiranahang\kern\commahang} 

\def \period{ . \kern-\periodhang\kern\periodhang} 

\def \rquote{ ’\kern-\quotehang\kern\quotehang} 

\def \lquote{\ifhmode\kern\quotehang\vadjust{}\else\leavevmode\f i 
\kern-\quotehang ‘ \allowhyphens} 

\catcode ‘ , =\active \let , =\couima \catcode ‘ . =\active \let . =\period 
\catcode ‘ ’ =\active \def ’ {\futurelet\next\rqtest} 

\catcode ‘ ‘ =\active \def ‘ {\futurelet\next\lqtest} 

\def \rqtest{\ifx\next ’ \let\next=\rquotes\else\let\next=\rquote\f i\next} 
\def \lqtest{\ifx\next ‘ Met\next=\lquotes\else\let\next=\lquote\f i\next} 
\def \rquotes ’ {\rqq\kern-\qquotehang\kern\qquotehang} 

\def Mquotes ‘{\ifhmode\kern\qquotehang\vadjust{}\else\leavevmode\f i 
\ker n- \qquot ehang\ lqq\ al lo why phens } 

Notice that the macros need to do their own checking for ligatures, and they also take 
appropriate actions when a paragraph begins with an opening quote. Since \kern 
does not affect the space factor, hanging punctuation doesn’t affect TgX’s spacing 
conventions within a line. Partially hanging punctuation can be obtained by decreasing 
the amounts of \commahang, etc. The macros \pnt, \lq, and \rq should be used 
in constants; for example, a dimension of 6.5 in must be written ‘6\pnt5in’ when 
hanging punctuation is in effect, and ‘\catcode\lq, =12’ makes commas inactive again. 
A special font with zero-width \hyphenchar should be used for “hanging hyphenation.” 

And now for our next trick, let’s consider an application to short footnotes. 
The footnotes at the bottom of this page 1 ’ 2 ’ 3 ’ 4 5 ’ 6 ’ 6 ’ 7 ’ 8 ’ 9 * 10 look funny, because most of 
them are quite short. When a document has lots of footnotes, and when most of them 
take up only a small part of a line, the output routine ought to reformat them in some 
more appropriate way. 


space factor 
pnt 

lq 

rq 

hyphenchar 
footnotes, short 


1 First footnote. 

2 Second footnote. (Every once in a while a long footnote might occur, just to make 
things difficult.) 

3 Third footnote. 

4 Fourth footnote. 

5 Fifth footnote. (This is incredibly boring, but it’s just an example.) 

6 Another. 

7 And another. 

8 Ho hum. 

9 Umpteenth footnote. 

10 Oodles of them. 
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For example, one approach would be to typeset the footnotes in narrow 
columns and to put, say, three columns of footnotes at the bottom of each page. The 
ten example footnotes might then look like this: 


1 First footnote. 

2 Second footnote. (Every 
once in a while a long 
footnote might occur, 
just to make things 
difficult.) 


3 Third footnote. 

4 Fourth footnote. 

5 Fifth footnote. (This is 
incredibly boring, but it’s 
just an example.) 


6 Another. 

7 And another. 

8 Ho hum. 

9 Umpteenth footnote. 
10 Oodles of them. 


In this case, the footnotes could be generated by 

\insert\f ootins{\eightpoint \hsize=9pc \parindent=lpc 
\leftskip=Opt \raggedright \pretolerance=10000 
\hyphenpenalty= 10000 \exhyphenpenalty= 10000 
\interlinepenalty=\interf ootnotelinepenalty 
\floatingpenalty=20000 

\splittopskip=\ht\strutbox \splitmaxdepth=\dp\strutbox 
\item{$~{\the\f ootno}$}\strut(text of footnote) \ strut 
\par\allowbreak} 


and \count\f ootins would be set to 333 so that each footnote line would be considered 
to occupy about one third of a line on the page. The output routine would then see a 
\box\footins that looks like this: 


\vbox (142 . 0+2 . 0)xl08. 0 

. \hbox(7. 0+2 . 0)xl08 . 0 , glue set 42.23425fil [] 

. \penalty 0 

. \hbox(7. 0+2 . 0)xl08 . 0 , glue set 0.29266 [] 

Apenalty 250 

. \glue (\baselineskip) 1.44444 

. \hbox (5 . 55556+1 . 55556) x96. 0, glue set 0.8693, shifted 12.0 [] 

. \penalty 100 

. \glue (\baselineskip) 1.88889 

. \hbox (5 . 55556+1 . 55556) x96. 0, glue set 0.92438, shifted 12.0 [] 


raggedright 

strut 

allowbreak 

balancing 


. \hbox(7. 0+2 . 0)xl08 . 0 , glue set 18.56308fil [] 

. \penalty 0 

. \hbox(7. 0+2 . 0)xl08 . 0 , glue set 36.92476fil [] 

. \penalty 0 

The individual footnotes each end with Apenalty O’; footnotes that take up more than 
one line have larger penalties between the lines, and interline glue appears there too. 

How should the output routine break such a box up into three roughly equal 
pieces? Notice that the contents of the box are completely rigid, i.e, there is no glue 
that can stretch or shrink. Furthermore, we can assume that the contents of the box are 
regular, i.e., that the inter-baseline distances are all the same. In such circumstances 
a fairly simple balancing routine can be used to trisect the box. 

Let’s consider a more general problem: Suppose that a rigid vbox is given, 
n lines tall, where adjacent baselines are b units apart. Suppose also that the top 
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baseline is h units from the top of the vbox, where 0 < h < b. (In our footnote 
example, b = 9 pt and h = 7 pt; in the standard settings of plain TjgX, 6 = 12 pt and 
h = 10 pt. We might as well work the problem for general 6 and h .) It follows that 
the height of the vbox is H = h + b(n — 1) = bn + h — b. 

If n lines are to be distributed evenly into k columns, the first column should 
contain \n/k~\ lines. (This denotes the smallest integer greater than or equal to n/k.) 
For example, our application to footnotes has n = 16 and k = 3, hence the first column 
should contain 6 lines. After forming the first column, we have reduced the problem 
to n = 10 and k = 2, so two 5-line columns will complete the operation. (Notice that 
it is better to divide 16 into 6 + 5 + 5 instead of 6 + 6 + 4.) Once we have found the 
first column, it’s always possible to reduce the 6-column problem to a (k — l)-column 
problem, so we need only concentrate on finding the first column. 

Let m = \n/k ]. The height of the given box is bn + 6—6, and the height of 
the first column should be bm + 6 — 6; hence we want to do a \vsplit to that height. 
However, it isn’t necessary to calculate bm + 6 — 6 exactly, since a bit of arithmetic 
proves that 

, , , bn + h — b , , . „ , 

bm + 6—6 < F 6 < 6(ni + 1) + 6 — 6. 

k 

Therefore it suffices to \vsplit to height H' = H/k + 6; under the assumptions of 
rigidity, and assuming that a valid break is possible after each line, \vsplit to H' will 
split after the maximum number of lines that yield a box of height < H' . (We have 
observed that m lines produce a box of height < H' while m + 1 lines produce a box 
of height > H' .) The following TjrX code does this: 


vsplit 

divide 

splittopskip 

hfil 

hfilneg 

valign 

recursive macro 

vbadness 

underfull 


\newcount\k \newdimen\h '/, registers used by the \rigidbalance routine 
\def \rigidbalance#l#2 #3 {\setboxO=\box#l \k=#2 \h=#3 
\line{\splittopskip=\h \vbadness=10000 \hfilneg 
\valign{##\vf il\cr\do splits}}} 

\def \dosplits{\ifnum\k>0 \noalign{\hf il}\splitoff 
\global\advance\k-l\cr\dosplits\f i} 

\def \splitoff{\dimenO=\htO 

\divide\dimenO by\k \advance\dimenO by\h 
\vsplitO to \dimenO } 


This code is interesting on a number of counts. First, notice that the calculation does 
not depend on 6, only on 6 and the height of the given box; hence \rigidbalance has 
three parameters: a box register number, the number of columns k, and the top baseline 
height 6. The routine splits the given vbox into k nearly equal pieces and justifies the 
result in a Mine. The value of \splittopskip is set to 6 so that subsequent vboxes 
will satisfy the ground rules of the original vbox, as the problem is reduced from k 
to k — 1. Each column will be preceded by \hf il, hence \hfilneg is used to cancel the 
\hfil before the first column. A \valign is used to align all of the columns at the top. 
Notice that the preamble to this \valign is quite simple; and the body of the \valign 
is generated by a recursive macro \dosplits that produces the k columns. The value 
of \vbadness is set to 10000 because each \vsplit operation will produce an underfull 
vbox whose badness is 10000. 
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In our application to footnotes, the \output routine can reformat the contents 
of \box\footins by saying, for example, 

\rigidbalance\f ootins 3 7pt 
\setbox\f ootins=\lastbox 

since \lastbox will be the result of \rigidbalance. 

This solution to the problem of short footnotes might result in club lines or 
widow lines, since the balancing routine we have described simply trisects the total 
number of lines. For example, if the tenth footnote of our example had not been 
present, the fifteen remaining lines would have been split 5 + 5 + 5; the second column 
would have been headed by the lonely word ‘difficult.)’, and the third column would have 
started with ‘just an example.)’. The rigid balancing procedure could be replaced by one 
that allows ragged-bottom columns, but there’s also another approach: The entire set 
of footnotes could be combined into a single paragraph, with generous spacing between 
the individual items. For example, the ten footnotes we have been considering might 
appear as follows: 

1 First footnote. 2 Second footnote. (Every once in a while a long footnote might occur, just 
to make things difficult.) 3 Third footnote. 4 Fourth footnote. “Fifth footnote. (This 
is incredibly boring, but it’s just an example.) 6 Another. 7 And another. 8 Ho hum. 
9 Umpteenth footnote. 10 Oodles of them. 

It would be possible to take the contents of \box\footins shown previously 
and to reformat everything into a paragraph, but such an operation would be needlessly 
complicated. If footnotes are to be paragraphed by the output routine, it’s better simply 
to prepare them in unjustified lrboxes. Each of these hboxes will be unboxed later, so 
we are free to play with their heights, widths, and depths. It’s convenient to set the 
depth to zero and the height to an estimate of how much a particular footnote will 
contribute to the final paragraph. For example, if a footnote takes up exactly half of 
the \hsize, and if the final footnote is going to be set with \baselineskip=10pt, then 
the height of the footnote hbox should be set to 5 pt. By letting \count\footins=1000, 
we’ll have a pretty good estimate of the size of the final footnote paragraph. In other 
words, the following insertion scheme is suggested: 

\ insert \footins{\floatingpenalty =20000 
\eightpoint \setboxO=\hbox{'/, 

$~{\the\footno}${text of footnote) \penalty-10\hskip\footglue} 
\dp0=0pt \htO=\fudgef actor\wd0 \boxO} 

The penalty of —10 tends to favor line breaks between footnotes; \footglue is the 
amount of glue between footnotes in the final footnote paragraph; and \fudgefactor 
is the ratio of \baselineskip to \hsize in that paragraph. The author defined the 
necessary quantities as follows in his experiments: 

\eightpoint \newskip\f ootglue \f ootglue=l . 5em plus.3em minus. 3em 
\newdimen\f ootnotebaselineskip \footnotebaselineskip=10pt 
\dimenO=\footnotebaselineskip \multiply\dimenO by 1024 
\divide \dimen0 by \hsize \multiply\dimenO by 64 
\xdef \fudgefactor{\expandafter\getf actor\the\dimenO } 

(The computation of \fudgef actor uses the fact that lpt = 1024 x 64 sp, and it 
assumes that the \f ootnotebaselineskip is less than 16 pt.) 


lastbox 

club lines 

widow lines 

ragged-bottom 

multiply 

divide 

sp 
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Inside the output routine, \box\footins will now be a vbox of liboxes, and 
the height of this vbox will be an estimate of the height of the final paragraph. For 
example, our ten footnotes produce 

\vbox (34 . 48158+0 . 0) x386 . 422 1 
,\hbox(2. 00175+0. 0)x70. 68285 [] 

,\hbox(10. 94359+0. 0)x386. 4221 [] 

,\hbox(2. 09749+0. 0)x74. 06345 [] 

,\hbox(2. 2077+0. 0)x77. 95517 [] 

,\hbox(7. 6296+0. 0)x269. 40376 [] 

. \hbox( 1.4085 1+0. 0)x49. 73532 [] 

,\hbox(l. 87659+0. 0)x66. 26334 [] 

Ahbox(l. 38826+0. 0)x49. 02003 [] 

Ahbox(2. 67213+0. 0)x94. 35402 [] 

Ahbox(2. 25597+0. 0)x79. 65926 [] 

and the height of 34.48158 pt corresponds to an estimate of about three and a half 
lines. (TpX’s page builder has also added \skip\f ootins when estimating the total 
contribution due to footnotes.) 

The reformatting of \box\f ootins takes place in three stages. First the vbox 
of liboxes is changed to an hbox of liboxes, so that we obtain, e.g., 

\hbox (10. 94359+0. 0)xl217. 5593 
Ahbox(2. 00175+0. 0)x70. 68285 [] 

Ahbox(2. 25597+0. 0)x79. 65926 [] 

(the same contents as before, but strung in a horizontal row instead of a vertical 
column). Then the inner liboxes are unboxed, and we obtain 

\hbox (6 . 68999+2 . 0)xl217 . 5593 
. \mathon 

Ahbox(3. 86665+0. 0)x4. 16661, shifted -2.82333 [] 

. \mathoff 
Aeightrm F 
. etc . 

Finally the outer hbox is unboxed, and the horizontal list inside it is converted into a 
paragraph. Here is the actual TpX code: 

\def \makef ootnoteparagraph{\unvbox\f ootins \makehboxofhboxes 
\setbox0=\hbox{\unhbox0 \removehboxes} 

\baselineskip=\f ootnotebaselineskip\noindent\unhboxO\par} 

\def \makehboxofhboxes{\setboxO=\hbox{} 

\loop\setbox2=\lastbox \ifhbox2 \setbox0=\hbox{\box2\unhbox0}\repeat} 
\def \removehboxes{\setboxO=\lastbox 
\ifhbox0{\removehboxes}\unhbox0 \f i} 

The \removehboxes operation is especially noteworthy, because it uses TpX’s save 
stack to hold all of the liboxes before unboxing them. Each level of recursion in this 
routine uses one cell of input stack space and three cells of save stack space; thus, 


unhbox 
unvbox 
save stack 
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it is generally safe to do more than 100 footnotes without exceeding TjjX’s capacity. 
The \makehboxofhboxes routine is not as efficient; T)gX doesn’t allow a vbox to be 
unboxed in horizontal mode, or vice versa, hence the trick of \removehboxes cannot be 
used. This means that the running time is proportional to n 2 , if there are n footnotes, 
because the time to make or unmake a box is proportional to the number of items in 
the top-level list inside. However, the constant of proportionality is small, so there is 
no need to resort to a more complicated scheme that would be asymptotically faster. 
Indeed, the \lastbox operation itself has a running time approximately equal to a + mb, 
where m is the number of items on the list preceding the box that is removed; hence 
\removehboxes lias a running time of order n 2 as well. But the constant b is so small 
that for practical purposes it’s possible to think of \lastbox as almost instantaneous. 
Note, however, that it would be a mistake to bypass the \removehboxes operation by 
saying ‘\setbox0=\hbox{\unhbox2\unhbox0}’ in \makehboxofhboxes; that would make 
the top-level list inside \boxO too long for efficient unboxing. 

8. Communication with output routines. It would be possible to write an entire book 
about T[rX output routines; but the present appendix is already too long, so it will 
suffice to mention only one or two sneaky tricks that a person might not readily think of. 
(Appendix E gives some less sneaky examples.) 

Sometimes an output routine needs to know why it was invoked, so there’s 
a problem of communicating information from the rest of the program. T)gX provides 
general \mark operations, but marks don’t always yield the right sorts of clues. Then 
there’s \outputpenalty, which can be tested to see what penalty occurred at a break- 
point; any penalty of —10000, —10001, —10002, or less, forces the output routine to 
act, hence different penalty values can be used to pass different messages. (When 
the output routine puts material back on the list of contributions, it need not restore 
the penalty at the breakpoint.) If output has been forced by a highly negative value 
of \outputpenalty, the output routine can use \vbox{\unvcopy255} to discover how 
full the page-so-far actually is. Underfull and overfull boxes are not reported when 
\box255 is packaged for use by the output routine, so there’s no harm in ejecting a 
page prematurely if you want to pass a signal. (Set \holdinginserts positive to pass 
a signal when the contents of \box255 will be sent back through the page builder again, 
if any insertions are present.) 

Perhaps the dirtiest trick of all is to communicate with the output routine via 
the depth of \box255. For example, suppose that you want to know whether or not 
the current page ends with the last line of a paragraph. If each paragraph ends with 
‘\specialstrut’, where \specialstrut is like \strut but 1 sp deeper, then \dp255 will 
have a recognizable value if a page ends simultaneously with a paragraph. (Of course, 
\maxdepth must be suitably large; plain TjjX takes \maxdepth=4pt, while struts are 
normally 3.5 pt deep, so there’s no problem.) A distance of 1000 sp is invisible to the 
naked eye, so a variety of messages can be passed in this way. 

If the value of \vsize is very small, TfrX will construct paragraphs as usual 
but it will send them to the output routine one line at a time. In this way the output 
routine could attach marginal notes, etc., based on what occurs in the line. Paragraphs 
that have been rebuilt in this way can also be sent back from the output routine to the 
page builder; normal page breaks will then be found, if \vsize has been restored. 

An output routine can also write notes on a file, based on what occurs in a 
manuscript. A two-pass system can be devised where TjrX simply gathers information 
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during the first pass; the actual typesetting can be done during the second pass, using 
\read to recover information that was written during the first. 

9. Syntax checking. Suppose you want to run a manuscript through Tj^X just to check 
for errors, without getting any output. Is there a way to make Tj^X run significantly 
faster while doing this? Yes; here’s how: (1) Say ‘\f ont\duramy=dummy’ ; your system 
should include a file dummy .tfm that defines a font with no characters (but with enough 
\fontdimen parameters to qualify as a math symbol font). (2) Set all the font identifiers 
you are using equal to \dummy. For example, \let\tenrm=\dummy, \let\tenbf=\dummy, 
. . . , \textfontO=\dummy, etc. (3) Say ‘\dummy’ to select the dummy font (since plain 
TjrjX may have selected the real \tenrm). (4) Set \tracinglostchars=0, so that TjjX 
won’t complain when characters aren’t present in the dummy font. (5) Set 

\output={\setbox0=\box255\deadcycles=0} 

so that nothing will be shipped out, yet TjgX will not think that your output routine 
is flaky. (6) Say \newtoks\output, so that no other output routine will be defined. 

(7) Say \frenchspacing so that TjgX will not have to do space factor calculations. 

(8) Say \hbadness=10000 so that underfull boxes will not be reported. (9) And if you 
want to disable \write commands, use the following trick due to Frank Yellin: 

\let\immediate=\relax \def \write#l#{{\afterassignment}\toksO=} 

These changes usually make TgX run more than four times as fast. 


dummy 

tracinglostchars 

deadcycles 

newtoks 

frenchspacing 

hbadness 

Yellin 

afterassignment 

Wolfe 

Hombert, Humbert 

STOUT 

Watson 

Holmes 

DOYLE 


Wolfe, who had moved around the desk and into his chair, 
put up a palm at him: “ Please , Mr. Hombert. 

I think it is always advisable to take a short-cut when it is feasible . " 

— REX STOUT, The Rubber Band (1936) 

“My dear Watson, try a little analysis yourself 
said he, with a touch of impatience. 

“ You know my methods. Apply them, 
and it will be instructive to compare results 
— CONAN DOYLE, The Sign of the Four (1890) 



Example Formats 



Appendix E: Example Formats 403 


Although the plain Tj^X format of Appendix B is oriented to technical reports, 
it can readily be adapted to quite different applications. Examples of three such 
adaptations are provided in this appendix: (1) a format for business letters; 
(2) a format for concert programs; (3) the format used to typeset this book. 

Let’s consider business letters first. Suppose that you want T^X to for- 
mat your correspondence, and that you have n letters to send. If your computer 
system contains a file letterformat.tex like the one described later in this 
appendix, it’s easy to do the job by applying T^jX to a file that looks like this: 

{optional magnification) 

\input letterformat 
(business letteri) 

{business letter,,) 

\end 

Here each of the n business letters has the form 

(letterhead) 

\address 

(one or more lines of address) 

\body 

(one or more paragraphs of text) 

\closing 

(one or more lines for salutation and signature) 

(optional annotations) 

(optional postscripts) 

\endletter 

\makelabel "/ omit this if you don’t want an address label 

The (letterhead) at the beginning of this construction is usually a control se- 
quence like \rjdletterhead for letters by R. J. D.; each letter writer can have 
a personalized letterhead that is stored with the letterformat macros. The 
(optional annotations) at the end are any number of one-line notes preceded by 
‘\annotations’; the (optional postscripts) are any number of paragraphs pre- 
ceded by ‘\ps’. When Tp.X is processing the \address and the \closing and the 
optional \annotations, it produces output line-for-line just as the lines appear 
in the input file; but when TgX is processing the \body of the letter and the 
optional \ps, it chooses line breaks and justifies lines as it normally does when 
typesetting paragraphs in books. 

A complete example, together with the resulting output, appears on 
the next two pages. This example starts with ‘\magnif ication=\magstepl’ 
because the letter is rather short. Magnification is usually omitted if the letters 
are long-winded; ‘\magnif ication=\magstephalf ’ is appropriate when they are 
medium-size. The same magnification applies to all n letters, so you must run 
Tf^X more than once if you want more than one magnification. 
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end 
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body 

closing 

endletter 
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\magnif ication=\magstepl 
\ input letterformat 

\rjdletterhead °/ t (see the output on the next page) 
\address 

Prof . ~Br ian~K . Reid 

Department of Electrical Engineering 
Stanford University 
Stanford, GA 94305 

\body 

Dear Prof.~Reid: 

I understand that you are having difficulties with 
Alka-Seltzer tablets. Since there are 25~pills 
per bottle, while the manufacturer’s directions 
recommend ‘ ‘plop,~plop, f izz , ~f izz , ’ ’ my colleagues 
tell me that you have accumulated a substantial 
number of bottles in which there is one tablet 
left. (See the 1978 SCRIBE User Manual, page 90.) 

At present I am engaged in research on the potential 
applications of isolated analgesics. If you would 
be so kind as to donate your Alka-Seltzer collection 
to our project, I would be more than happy to send 
you preprints of any progress reports that we may 
publish concerning this critical problem. 

\closing 

Sincerely, 

R. J. Drofnats 
Professor 

\annotations 

RJD/dek 

cc: {\sl The \TeX book} 

\ps 

P. S. \ If you like, I will check into the 
possibility that your donation and the meals that 
you have been eating might be tax-deductible, in 
connection with our research. 

\endletter 

\makelabel 
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If the letter is more than one page long, the addressee, date, and page 
number will appear at the top of subsequent pages. For example, the previous 
letter comes out as follows, if additional paragraphs are added to the text: 


(First page, reduced to 28.3%.) 


(Second page, reduced to 28.3%.) 


date 

today 

ifcase 

month 

or 

space 

number 

day 

year 

interlinepenalty 

raggedbottom 

hsize 

voffset 

vsize 

parindent 

parskip 

nopagenumbers 

headline 

pageno 

folio 


The macro package letterformat.tex that produces this format begins with 
a simple macro that expands to the current date. 

\def \today{\if case\month\or 

January\or February\or March\or April\or May\or June\or 
July\or August\or September\or October\or November\or December\fi 
\space\number\day , \number\year} 

Then comes the specification of page layout, which is “ragged” at the bottom. 
A rather large \interline penalty is used so that page breaks will tend to occur 
between paragraphs. 

\raggedbottom 
\interlinepenalty=1000 
\hsize=6 . 25truein 
\vof f set=24pt 

\advance\vsize by-\voffset 
\par indent =0pt 
\parskip=0pt 
\nopagenumbers 
\headline={\if num\pageno>l 

\tenrm To \addressee\hf il\today\hf il Page \folio 
\else\hf il\f i} 
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The contents of a letter are typeset either in “line mode” (obeying lines) 
or in “paragraph mode” (producing paragraphs in block style). Control sequences 
\beginlinemode and \beginparmode are defined to initiate these modes; and another 
control sequence, \endmode, is defined and redefined so that the current mode will 
terminate properly: 

\def \beginlinemode{\endmode 

\begingroup\obeylines\def\endmode{\par\endgroup}} 

\def \beginparmode{\endmode 

\begingroup\parskip=\medskipamount \def\endmode{\par\endgroup}} 
\let\endmode=\par 

\def \endletter{\endmode\vf ill\supere ject} 

One of the chief characteristics of this particular business letter format is a 
parameter called \longindentation, which is used to indent the closing material, the 
date, and certain aspects of the letterhead. The \address macro creates a box that 
will be used both in the letter and in the label on the envelope. If individual lines 
of the address exceed \longindentation, they are broken, and hanging indentation is 
used for any material that must be carried over. 

\newdimen\longindentation \longindentation=4truein 
\newbox\theaddress 

\def\address{\beginlinemode\get address} 

{\obeylines\gdef \getaddress #1 
#2 

{#l\gdef \addressee{#2}'/, 

\global\setbox\theaddress=\vbox\bgroup\raggedright'/, 
\hsize=\longindentation \everypar{\hangindent2em}#2 
\def\endmode{\egroup\endgroup \copy\theaddress \bigskip}}} 

(Parameter #2 to \getaddress will be the contents of the line following \address, i.e. , 
the name of the addressee.) 

The closing macros are careful not to allow a page break anywhere between 
the end of the \body and the beginning of a \ps. 

\def \body{\beginparmode} 

\def\closing{\beginlinemode\get closing} 

{\obeylines\gdef \getclosing #1 
#2 

{#l\nobreak\bigskip \leftskip=\longindentation #2 

\nobreak\bigskip\bigskip\bigskip */, space for signature 
\def 

{\endgraf \nobreak}}} 

\def \annotations{\beginlinemode\def\par{\endgraf\nobreak}\obeylines\par} 
\def\ps{\beginparmode\no break 

\interlinepenalty5000\def\par{\endgraf\penalty5000}} 

The remaining portion ofletterformat.tex deals with letterheads and labels, 
which of course will be different for different organizations. The following macros were 
used to generate the examples in this appendix; they can be modified in more-or-less 
obvious ways to produce suitable letterheads of other kinds. Special fonts are generally 
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needed, and they should be loaded at ‘true’ sizes so that they are not affected by 
magnification. One tiny refinement worth noting here is the \up macro, which raises 
brackets so that they look better in a telephone number. 

\def \up#l{\leavevmode \raise. 16ex\hbox{#l}} 

\f ontXsmallheadf ont=cmr8 at 8truept 
\f ont\largeheadf ont=cmdunhlO at 14.4truept 
\f ont\logofont=manfnt at 14.4truept 

\def \r jdletterhead{ 

\def \sendingaddress{R. J. DROFNATS, F.T.U.G.Xpar 
PROFESSOR OF FARM EC0L0GY\par 
TEX.RJD <§ SU-SCORE . ARPAXpar 
\up [415\up] Xthinspace 497-4975\par} 

\def\returnaddress{R. J. Drofnats, Dept. "of Farm Ecology Xpar 
The University of St . "Anf ordXpar 
P. 0. Box 1009, Haga Alto, CA 94321 USA} 

Xletterhead} 

\def \letterhead{\pageno=l \def \addressee{} Xunivletterhead 
{\leftskip=\longindentation 

{\baselineskip9truept\smallheadf ontXsendingaddress} 
\bigskip\bigskip\rm\today\bigskip}} 

\def \univletterhead{\vglue-\voff set 

\hbox{\hbox to\longindentation{\raise4truemm\hbox{\logofont 
\kern2truept X\kern-1 . 667truept 
\lower2truept\hbox{X}\kern-l . 667truept X}\hf il 
Xlargeheadf ont The University of St . "AnfordXhf il}'/, 
\kern-\longindentation 

\vbox{\smallheadf ont\baselineskip9truept 

\leftskip=\longindentation BOX 1009\par HAGA ALTO, CA 94321}} 
\vskip2truept\hrule\vskip4truept } 

\def \makelabel{\endletter\hbox{\vrule 
\vbox{\hrule \kern6truept 

\hbox{\kern6truept\vbox to 2truein{\hsize=\longindentation 
\smallheadfont\baselineskip9truept\returnaddress 
\vf illXmoveright 2truein\copy\theaddress\vf ill}'/, 
Xkern6truept}\kern6truept\hrule}\vrule} 

\pageno=0\vf illXeject} 


true 

up 

brackets 

telephone number 
leavevmode 
raise 
at 

TeX Users Group 
atsign 
makelabel 
concert programs 
music 

programs, for music 
sizes of type 
Bach 
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type size 

bigtype 

medtype 

smalltype 

size switching 


Our second example is a format for concert programs, to be used in connection 
with orchestra performances, recitals, and the like. We shall assume that the entire 
program fits on a single page, and that the copy is to be 4 inches wide. Comparatively 
large type (12 pt) will normally be used, but there is a provision for 10 pt and even 
8pt type in case the program includes pieces with a lot of subparts (e.g., Bach’s Mass 
in B minor, or Beethoven’s Diabelli Variations). To select the type size, a user says 
Xbigtype, Xmedtype, or Xsmalltype, respectively. These macros for size switching are 
comparatively simple because concert programs don’t require any mathematics; hence 
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the math fonts don’t need to be changed. On the other hand, the format does take 
sharp and flat, signs from the “math italic” font, which it calls ‘\mus’: 

\f ont\twelverm=cmrl2 
\f ont\twelvebf =cmbxl2 
\f ont\twelveit=cmtil2 
\f ont\twelvesl=cmsll2 
\font\twelvemus=cmmil2 
\f ont\eightrm=cmr8 
\f ont\eightbf=cmbx8 
\f ont\eightit=cmti8 
\f ont\eightsl=cmsl8 
\f ont\eightmus=cmmi8 

\def\bigtype{\let\rm=\twelverm \let\bf=\twelvebf 

\let\it=\twelveit \let\sl=\twelvesl \let\mus=\twelvemus 

\baselineskip=14pt minus lpt 

\rm} 

\def \medtype{\let\rm=\tenrm \let\bf=\tenbf 
\let\it=\tenit \let\sl=\tensl \let\mus=\teni 
\baselineskip=12pt minus lpt 
\rm} 

\def \smalltype{\let\rm=\eightrm \let\bf =\eightbf 

\let\it=\eightit \let\sl=\eightsl \let\mus=\eightmus 

\baselineskip=9. 5pt minus .75pt 

\rm} 

\hsize=4in 

\nopagenumbers 

\bigtype 

Notice the shrinkability in the \baselineskip settings. This would be undesirable in a 
book format, because different spacing between lines on different pages would look bad; 
but in a one-page document it helps squeeze the copy to fit the page, in an emergency. 
(There’s no need for stretcliability in the baselineskip here, because a \vfill will be 
used at the bottom of the page. ) 

Musical programs have a specialized vocabulary, and it is desirable to define 
a few control sequences for things that plain TpX doesn’t make as convenient as they 
could be for this particular application: 

\def\(#l){{\rm(}#l\/{\rm)}} 

\def \sharp{\raise . 4ex\hbox{\mus\char "5D}} 

\def \f lat{\raise . 2ex\hbox{\mus\char"5B}} 

\let\ , =\thinspace 

The \( macro produces roman parentheses in the midst of italicized text; the \sharp 
and \flat macros produce musical signs in the current type size. The \, macro makes 
it easy to specify the thin space that is used in constructions like ‘K. 550’ and ‘Op. 59’. 
(Plain T^rX has already defined \, and \sharp and \flat in a different way; but those 
definitions apply only to math formulas, so they aren’t relevant in this application.) 

Before discussing the rest of the music macros, let’s take a look at a complete 
example. The next two pages show the input and output for a typical concert program. 
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\input concert 

\tsaologo 

\medskip 

\centerline{Friday , November 19, 1982, 8:00 p.m.} 

\bigskip 

\centerline{\bf PROGRAM} 

\medskip 

\composition{Variations on a Theme by Tchaikovsky} 

\composer{Anton S. Arensky (1861 — 1906)} 

\smallskip 

{\medtype 

\movements{Tema: Moderato\cr 

Var."I: Un poco pi\‘u mosso&Var . "V: Andante\cr 
Var."II: Allegro non troppo&Var . "VI : Allegro con spirito\cr 
Var."III: Andantino tranquillo&Var . "VII : Andante con moto\cr 
Var."IV: Vivace&Coda: Moderato\cr} 

} 

\bigskip 

\composition{Concerto for Horn and Hardart , S.\,27} 

\composer{P. D. Q. Bach (1807—1742)?} 

\smallskip 

\movements{Allegro con brillo\cr 

Tema con variazione \(su una tema differente)\cr 
Menuetto con panna e zucchero\cr} 

\medskip 

\soloists{Ben Lee User, horn\cr 
Peter Schickele, hardart\cr} 

\bigskip 

\centerline{ INTERMISSION} 

\bigskip 

\composition{Symphony No.\,3 in E\flat\ Major\cr 
0p.\,55, ‘‘The Eroica’ ’\cr} 

\composer{Ludwig van Beethoven (1770 — 1827)} 

\smallskip 

\movements{Allegro con brio\cr 
Marcia funebre: Adagio assai\cr 
Scherzo: Allegro vivace\cr 
Finale: Allegro molto\cr} 

\bigskip 

\smalltype \noindent 

Members of the audience are kindly requested to turn off the 
alarms on their digital watches, and to cough only between movements. 

\bye 


Tchaikovsky, see Chai... 

Arenskii 

Bach, PDQ 

User 

Schickele 

Beethoven 
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THE ST. ANFORD ORCHESTRA 
R. J. Drofnats, Conductor 


Friday, November 19, 1982, 8:00 p.m. 


PROGRAM 

Variations on a Theme by Tchaikovsky 


Anton 

Tema: Moderato 
Var. I: Un poco pin mosso 
Var. II: Allegro non troppo 
Var. Ill: Andantino tranquillo 
Var. IV: Vivace 


. Arensky (1861-1906) 

Var. V: Andante 
Var. VI: Allegro con spirito 
Var. VII: Andante con moto 
Coda: Moderato 


Concerto for Horn and Hardart, S. 27 

P. D. Q. Bach (1807-1742)? 

Allegro con brillo 

Tema con variazione ( su una tema differente ) 
Menuetto con panna e zucchero 

Ben Lee User, horn 
Peter Schickele, hardart 

INTERMISSION 

Symphony No. 3 in Eb Major 
Op. 55, “The Eroica” 

Ludwig van Beethoven (1770-1827) 

Allegro con brio 
Marcia funebre: Adagio assai 
Scherzo: Allegro vivace 
Finale: Allegro molto 

Members of the audience are kindly requested to turn off the alarms on their 
digital watches, and to cough only between movements. 


Drofnats 
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Most of the macros in concert.tex have already been defined. Plain Tj?X 
takes care of things like \centerline and \bigskip, so only \composition, \composer, 
\movements, and \soloists remain to be specified: 

\def \composition#l{\halign{\bf\quad##\hf il\cr 

\kern-lem#l\crcr}} '/, use \cr’s if more than one line 
\def \composer#l{\rightline{\bf#l}} 

\def \movements#l{\halign{\quad\it##\hf il&&\qquad\it##\hf il\cr#l\crcr}} 

\def \soloists#l{\centerline{\bf \vbox{\halign{##\hf il\cr#l\crcr}}}} 

The \composition macro is set up to put. the title of the composition on two or 
more lines, if needed, but a single line usually suffices. Notice that \crcr has been 
used so that the final \cr in the argument to \composition is not needed. Similarly, 
\movements might be used to produce only a single line, and \soloists might be used 
when there is only one soloist. 

There’s also a \tsaologo macro. It applies only to one particular orchestra, 
but the definition is somewhat interesting nonetheless: 

\def \tsaologo{\vbox{\bigtype\bf 
\line{\hrulef ill} 

\kern- . 5\baselineskip 

\line{\hrulefill\phantom{ THE ST . \ , ANFORD ORCHESTRA }\hrulefill} 
\kern- . 5\baselineskip 

\line{\hrulefill\hbox{ THE ST . \ , ANFORD ORCHESTRA }\hrulefill} 
\kern- . 5\baselineskip 

\line{\hrulef ill\phantom{ R. J. Drofnats, Conductor }\hrulefill} 
\kern- . 5\baselineskip 

\line{\hrulef ill\hbox{ R. J. Drofnats, Conductor }\hrulefill} 

n 

The author has extended these macros to a more elaborate format that in- 
cludes special features for listing the members of the orchestra and for program notes, 
etc.; in this way it becomes fairly easy to typeset little booklets for concert patrons. 
Such extensions need not be discussed further in this appendix, because they don’t 
illustrate any essentially new ideas. 

Notice that the \composition and \movements and \soloists macros do 
not include any special provision for vertical spacing; the user is supposed to insert 
\smallskip, \medskip, and \bigskip as desired. This was done deliberately, because 
different concert programs seem to demand different spacing; no automatic scheme 
actually works very well in practice, since musical literature is so varied. 

Let’s turn now to the design of a format for an entire book, using this book 
itself as an example. How did the author prepare the computer file that generated The 
TfiXbook? We have already seen several hundred pages of output produced from that 
file; our goal in the remainder of this appendix will be to examine the input that was 
used behind the scenes. 

In the first place, the author prepared sample pages and showed them to the 
publisher’s book designer. (The importance of this step cannot be overemphasized. 
There is a danger that am hors — who are now able to typeset their own books with 
T}?X — will attempt to do their own designs, without professional help. Book design is 
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an art that requires considerable creativity, skill, experience, and taste; it is one of the 
most, important services that a publisher traditionally provides to an author. ) 

Sample pages that are used as the basis of a design should show each of the 
elements in the book. In this case the elements included chapter titles, illustrations, 
subchapter headings, footnotes, displayed formulas, typewriter type, dangerous bends, 
exercises, answers, quotations, tables, numbered lists, bulleted lists, etc.; the author 
also expressed a desire for generous margins, so that readers could make marginal notes. 

The designer, Herb Caswell, faced a difficult problem of bringing all those 
disparate elements into a consistent framework. He decided to achieve this by using a 
uniform indentation of 3 picas for normal paragraph openings as well as for dangerous 
bends; and to establish this element of the design by using it also for all the displayed 
material, instead of centering the displays. 

He decided to put the page numbers in bold type, out in the margins (where 
there was plenty of room, thanks to the author’s request for white space); and he 
decided to use italic type with caps and lower case for the running headlines, so that 
the pages would have a somewhat informal flavor. 

He chose 10-point type (on a 12-point base) for the main text, and 9-point 
type (on an 11-point, base) for the dangerous bends; the typeface was predetermined. 
He chose an \hsize of 29 picas and a \vsize of 44 picas. He decided to give subhead- 
ings like ‘►EXERCISE 13.8’ in boldface caps before the statement of each exercise. 
He specified the amount of vertical space before and after such things as exercises, 
dangerous-bend paragraphs, and displayed equations. He decided to devote an entire 
left-hand page to each chapter illustration. And so on; each decision influenced the 
others, so that the final book would appear to be as coherent and attractive as possible 
under the circumstances. After the main portion of the book was designed, he worked 
out. a format for the front, matter (i.e., the pages that, precede page 1); he arranged to 
have the same amount of “sinkage” (white space) at the top of each page there, so that 
the opening pages of the book would look unified and “open.” 

The author hasn’t actually followed the designer’s specifications in every de- 
tail. For example, nothing about stretching or shrinking of vertical spaces appeared in 
the design specs; the author introduced the notion of flexible glue on his own initiative, 
based on his observations of cut-and-paste operations often used in page makeup. If 
this book has any beauties, they should be ascribed to Herb Caswell; if it has any 
blemishes, they should be ascribed to Don Knut.li, who wrote the formatting macros 
that we are now about to discuss. 

The computer file manual.tex that generated The TEKbook begins with a 
copyright notice, and then it says ‘\input manmac’. The auxiliary file manmac.tex con- 
tains the formatting macros, and it. begins by loading 9-point., 8-point., and 6-point fonts: 


\f ont\ninerm=cmr9 
\f ont\ninei=cmmi9 
\f ont\ninesy=cmsy9 
\f ont\ninebf =cmbx9 
\f ont\ninett=cmtt9 
\f ont\nineit=cmti9 
\f ont\ninesl=cms!9 


\font\eightrm=cmr8 

\font\eighti=cmmi8 

\font\eightsy=cmsy8 

\font\eightbf =cmbx8 

\font\eighttt=cmtt8 

\font\eightit=cmti8 

\font\eightsl=cms!8 


\f ont\sixrm=cmr6 
\f ont\sixi=cmmi6 
\f ont\sixsy=cmsy6 
\f ont\sixbf=cmbx6 
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(These fonts had been \preloaded in Appendix B; now they’re officially loaded.) 
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The fonts intended for math formulas need to have a nonstandard \skewchar. 
The typewriter fonts are given \hyphenchar=-l so that hyphenation is inhibited when 
control sequence names and keywords appear in the text of a paragraph. 

\skewchar\ninei= ’ 177 \skewchar\eighti= ’ 177 \skewchar\sixi= ’ 177 
\skewchar\ninesy=’60 \skewchar\eightsy=’60 \skewchar\sixsy=’60 
\hyphenchar\ninett=- 1 \hyphenchar\eighttt=- 1 \hyphenchar\tentt=- 1 

A few more fonts are needed for special purposes: 


\f ont\tentex=cmtexlO 
\f ont\inchhigh=cminch 
\f ont\titlef ont=cmssdclO at 40pt 
\f ont\eightss=cmssq8 
\f ont\eightssi=cmssqi8 
\f ont\tenu=cmulO 
\f ont\manual=manfnt 
\f ont\magnif iedf iverm=cmr5 at lOpt 


'/, TeX character set as in Appendix C 
'/, inch-high caps for chapter openings 
titles in chapter openings 
'/, quotations in chapter closings 
ditto , slanted 
'/, unslanted text italic 
'/, METAFONT logo and special symbols 
'/, to demonstrate magnification 


Now we come to the size-switching macros, which are much more elaborate 
than they were in the previous example because mathematics needs to be supported in 
three different sizes. The format also provides for a pseudo “small caps” (\sc); a true 
caps-and-small-caps font was not really necessary in the few cases that \sc was used. 
A dimension variable \ttglue is set equal to the desired spacing for the typewriter-like 
text that occasionally appears in paragraphs; the \tt fonts have fixed spacing, which 
doesn’t mix well with variable spacing, hence the macros below use \ttglue between 
words in appropriate places. 


\catcode ‘0=11 '/, we will access private macros of plain TeX (carefully) 
\newskip\ttglue 

\def \tenpoint{\def \rm{\f amO\tenrm}'/, switch to 10-point type 

\textf ont0=\tenrm \scriptfontO=\sevenrm \scriptscriptf ont0=\f iverm 
\textf ontl=\teni \scriptfontl=\seveni \scriptscriptf ontl=\f ivei 
\textf ont2=\tensy \scriptfont2=\sevensy \scriptscriptf ont2=\f ivesy 
\textf ont3=\tenex \scriptfont3=\tenex \scriptscriptf ont3=\tenex 

\textf ont\itfam=\tenit \def \it{\f am\itf am\tenit}'/, 

\textf ont\slfam=\tensl \def \sl{\f am\slf am\tensl}*/, 

\textf ont\ttfam=\tentt \def \tt{\f am\ttf am\tentt}'/, 

\textf ont\bffam=\tenbf \scriptf ont\bffam=\sevenbf 
\scriptscriptf ont\bffam=\f ivebf \def \bf{\f am\bff am\tenbf }*/, 

\tt \ttglue=.5em plus.25em minus. 15em 
\normalbaselineskip=12pt 

\setbox\strutbox=\hbox{\vrule height8.5pt depth3.5pt widthOpt}'/, 
\let\sc=\eightrm \let\big=\tenbig \normalbaselines\rm} 

\def \ninepoint{\def\rm{\famO\ninerm}*/, switch to 9-point type 

\textf ontO=\ninerm \scriptf ont0=\sixrm \scriptscriptf ont0=\f iverm 
\textf ontl=\ninei \scriptf ontl=\sixi \scriptscriptf ontl=\f ivei 
\textf ont2=\ninesy \scriptf ont2=\sixsy \scriptscriptf ont2=\f ivesy 
\textf ont3=\tenex \scriptf ont3=\tenex \scriptscriptf ont3=\tenex 
\textf ont\itfam=\nineit \def\it{\fam\itfam\nineit}'/, 
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\textf ont\slfam=\ninesl \def\sl{\fam\slfam\ninesl}'/, 

\textf ont\ttfam=\ninett \def\tt{\fam\ttfam\ninett}*/, 

\textf ont\bffam=\ninebf \scriptfont\bff am=\sixbf 
\scriptscriptf ont\bffam=\f ivebf \def \bf{\f am\bff am\ninebf}'/, 

\tt \ttglue=.5em plus.25em minus. 15em 
\normalbaselineskip=llpt 

\setbox\strutbox=\hbox{\vrule height8pt depth3pt widthOpt}'/, 
\let\sc=\sevenrm \let\big=\ninebig \normalbaselines\rm} 

\def \eightpoint{\def \rm{\f amO\eightrm}'/, switch to 8-point type 

\textf ontO=\eightrm \scriptfontO=\sixrm \scriptscriptf ontO=\f iverm 
\textf ontl=\eighti \scriptfontl=\sixi \scriptscriptf ontl=\f ivei 
\textf ont2=\eightsy \scriptfont2=\sixsy \scriptscriptf ont2=\f ivesy 
\textf ont3=\tenex \scriptfont3=\tenex \scriptscriptf ont3=\tenex 

\textf ont\itfam=\eightit \def \it{\f am\itf am\eightit}'/, 

\textf ont\slfam=\eightsl \def \sl{\f am\slf am\eightsl}'/, 

\textf ont\ttfam=\eighttt \def \tt{\f am\ttf am\eighttt}*/, 

\textf ont\bffam=\eightbf \scriptf ont\bffam=\sixbf 
\scriptscriptf ont\bffam=\f ivebf \def \bf{\f am\bff am\eightbf }'/, 

\tt \ttglue=.5em plus.25em minus. 15em 
\normalbaselineskip=9pt 

\setbox\strutbox=\hbox{\vrule height7pt depth2pt widthOpt}'/, 
\let\sc=\sixrm \let\big=\eightbig \normalbaselines\rm} 

\def \tenbig#l{{\hbox{$\lef t#l\vbox to8 . 5pt{}\right . \n@space$}}} 

\def \ninebig#l{{\hbox{$\textf ontO=\tenrm\textf ont2=\tensy 
\left#l\vbox to7. 25pt{}\right . \n@space$}}} 

\def \eightbig#l{{\hbox{$\textfont0=\ninerm\textfont2=\ninesy 
\left#l\vbox to6. 5pt{}\right . \n@space$}}} 

\def \tenmath{\tenpoint\f am-1 } '/, for 10-point math in 9-point territory 

Issues of page layout are dealt with next. First, the basics: 

\newdimen\pagewidth \newdimen\pageheight \newdimen\ruleht 
\hsize=29pc \vsize=44pc \maxdepth=2 . 2pt \parindent=3pc 
\pagewidth=\hsize \pageheight=\vsize \ruleht=.5pt 
\abovedisplayskip=6pt plus 3pt minus lpt 
\belowdisplayskip=6pt plus 3pt minus lpt 
\abovedisplayshortskip=Opt plus 3pt 
\belowdisplayshortskip=4pt plus 3pt 

(The curious value of \maxdepth was chosen only to provide an example in Chapter 15; 
there’s no deep reason behind it.) 

When the author prepared this book, he made notes about what things ought 
to go into the index from each page. These notes were shown in small type on his 
proofsheets, like the words ‘marginal hacks’ in the right margin of this page. The 
manmac format uses an insertion class called \margin to handle such notes. 

\newinsert\margin 

\dimen\margin=\maxdimen '/, no limit on the number of marginal notes 
\count\margin=0 \skip\margin=Opt '/, marginal inserts take up no space 
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The \footnote macro of plain TgX needs to be changed because footnotes 
are indented and set in 8-point type. Some simplifications have also been made, since 
footnotes are used so infrequently in this book. 

\def \f ootnote#l{\edef\@sf{\spacef actor\the\spacef actor}#l\@sf 
\ insert \f ootins\bgroup\eightpoint 
\interlinepenaltylOO \let\par=\endgraf 
\leftskip=Opt \rightskip=Opt 

\splittopskip=10pt plus lpt minus lpt \f loatingpenalty=20000 
\smallskip\item{#l}\bgroup\strut\aftergroup\@f oot\let\next} 
\skip\footins=12pt plus 2pt minus 4pt '/, space added when footnote exists 
\dimen\f ootins=30pc '/, maximum footnotes per page 

The text of running headlines will be kept in a control sequence called \rhead. 
Some pages should not have headlines; the \titlepage macro suppresses the headline 
on the next page that is output. 

\newif\if title 

\def \titlepage{\global\titletrue} '/, for pages without headlines 
\def\rhead-Q '/, \rhead contains the running headline 
\def \leftheadline{\hbox to \pagewidth{'/, 

\vbox to lOptO*/. strut to position the baseline 
\llap{\tenbf\folio\kernlpc}'/, folio to left of text 
\tenit\rhead\hf il}} '/, running head flush left 
\def \rightheadline{\hbox to \pagewidth{\vbox to 10pt{}'/, 

\hf il\tenit\rhead\/'/, running head flush right 
\rlap{\kernlpc\tenbf\folio}}} '/, folio to right of text 

Pages are shipped to the output by the \onepageout macro, which attaches 
headlines, marginal notes, and/or footnotes, as appropriate. Special registration marks 
are typeset at the top of title pages, so that the pages will line up properly on printing 
plates that are made photographically from TfgXs “camera-ready” output. A small 
page number is also printed next to the corner markings; such auxiliary information 
will, of course, be erased before the pages are actually printed. 

\def \onepageout#l{\shipout\vbox{ ’/, here we define one page of output 
\off interlineskip '/, butt the boxes together 
\vbox to 3pc{ '/, this part goes on top of the 44pc pages 
\iftitle \global\titlef alse \setcornerrules 

\else\ifodd\pageno\rightheadline\else\leftheadline\f i\f i \vf ill} 
\vbox to \pageheight{ 

\ifvoid\margin\else '/, marginal info is present 
\rlap{\kern31pc\vbox to0pt{\kern4pt\box\margin \vss}}\fi 
#1 */, now insert the main information 
\ifvoid\f ootins\else '/, footnote info is present 
\vskip\skip\f ootins \kern-3pt 

\hrule height\ruleht width\pagewidth \kern-\ruleht \kern3pt 
\unvbox\f ootins\f i 
\boxmaxdepth=\maxdepth}} 

\advancepageno} 
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\def \setcornerrules{\hbox to \pagewidth{'/, for camera alignment 
\vrule width lpc height \ruleht \hfil \vrule width lpc} 

\hbox to \pagewidth{\llap{\sevenrm(page \folio)\kernlpc}'/, 

\vrule heightlpc width\ruleht depthOpt 
\hfil \vrule width\ruleht depthOpt}} 

\output{\onepageout{\unvbox255}} 

A different output routine is needed for Appendix I (the index), because most 
of that appendix appears in two-column format. Instead of handling double columns 
with an ‘\lr’ switch, as discussed in Chapter 23, manmac does the job with \vsplit, 
after collecting more than enough material to fill a page. This approach makes it 
comparatively easy to balance the columns on the last page of the index. A more 
difficult approach would be necessary if the index contained insertions (e.g., footnotes); 
fortunately, it doesn’t. Furthermore, there is no need to use \mark as suggested in the 
index example of Chapter 23, since the entries in Appendix I tend to be quite short. 
The only real complication that manmac faces is the fact that Appendix I begins and 
ends with single-column format; partial pages need to be juggled carefully as the format 
changes back and forth. 

\newbox\part ialpage 

\def \begindoublecolumns{\begingroup 

\output={\global\setbox\partialpage=\vbox{\unvbox255\bigskip}}\eject 
\output={\doublecolumnout} \hsize=14pc \vsize=89pc} 

\def \enddoublecolumns{\output={\balancecolumns}\e ject 
\endgroup \pagegoal=\vsize} 

\def \doublecolumnout{\splittopskip=\topskip \splitmaxdepth=\maxdepth 
\dimen@=44pc \advance\dimen@ by-\ht\partialpage 
\setbox0=\vsplit255 to\dimen@ \setbox2=\vsplit255 to\dimen@ 
\onepageout\pagesofar \unvbox255 \penalty\outputpenalty} 

\def \pagesof ar{\unvbox\part ialpage 

\wdO=\hsize \wd2=\hsize \hbox to\pagewidth{\boxO\hf il\box2}} 

\def \balancecolumns{\setbox0=\vbox{\unvbox255} \dimen@=\htO 
\advance\dimen@ by\topskip \advance\dimen@ by-\baselineskip 
\divide\dimen@ by2 \splittopskip=\topskip 
{\vbadness=10000 \loop \global\setbox3=\copy0 
\global\setboxl=\vsplit3 to\dimen<§ 

\ifdim\ht3>\dimen@ \global\advance\dimen@ bylpt \repeat} 

\setboxO=\vbox to\dimen@{\unvboxl} \setbox2=\vbox to\dimen@{\unvbox3} 
\pagesof ar} 

The balancing act sets \vbadness infinite while it is searching for a suitable column 
height, so that underfull vboxes won’t be reported unless the actual columns are bad 
after balancing. The columns in Appendix I have a lot of stretchability, since there’s a 
\parskip of Opt plus . 8pt between adjacent entries, and since there is room for more 
than 50 lines per column; therefore the manmac balancing routine tries to make both 
the top and bottom baselines agree at the end of the index. In applications where the 
glue is not so flexible it would be more appropriate to let the right-hand column be a 
little short; the best way to do this is probably to replace the command ‘\unvbox3’ by 
‘\dimen2=\dp3 \unvbox3 \kern-\dimen2 \vfil’. 
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The next macros are concerned with chapter formatting. Each chapter in the 
manuscript file starts out with the macro \beginchapter; it ends with \endchapter 
and two quotations, followed by \eject. For example, Chapter 15 was generated by 
TJrX commands that look like this in the file manual.tex: 

\beginchapter Chapter 15. How \TeX\ MakesWLines into Pages 

\TeX\ attempts to choose desirable places to divide your document into 
(about 1100 lines of the manuscript are omitted here) 
break. \ (Deep breath.) \ You got that? 

\endchapter 

Since it is impossible to foresee how [footnotes] will happen to come out 
in the make-up, it is impracticable to number them from 1 up on each page. 
The best way is to number them consecutively throughout an article 
or by chapters in a book. 

\ author UNIVERSITY OF CHICAGO PRESS, {\sl Manual of Style\/} (1910) 
\bigskip 

Don’t use footnotes in your books, Don. 

\ author JILL KNUTH (1962) 

\e ject 

The ‘\V in the title line specifies a line break to be used on the left-hand title page 
that faces the beginning of the chapter. Most of the \beginchapter macro is devoted 
to preparing that title page; the \TeX logo needs somewhat different spacing when it 
is typeset in \titlefont, and the \inchhigh digits need to be brought closer together 
in order to look right in a title. 

\newcount\exno '/, for the number of exercises in the current chapter 
\newcount\subsecno '/, for the number of subsections in the current chapter 

\outer\def \beginchapter#l #2#3. #4\par{\def \chapno{#2#3} 

\global\exno=0 \subsecno=0 
\ifodd\pageno 

\errmessage{You had too much text on that last page; I’m backing up} 
\advance\pageno by-1 \fi 

\def\\{ } '/, \\’s in the title will be treated as spaces 
\message{#l #2#3:} '/, show the chapter title on the terminal 
\xdef\rhead{#l #2#3: #4\unskip} '/, establish a new running headline 
{\def \TeX{T\kern- . 2em\lower . 5ex\hbox{E}\kern- . 06em X} 

\def\\{#3} 

\ifx\empty\\ \rightline{\inchhigh #2\kern- . 04em} 
\else\rightline{\inchhigh #2\kern- . 06em#3\kern- . 04em}\f i 
\vskipl.75pc \baselineskip=36pt \lineskiplimit=lpt \lineskip=12pt 
\let\\=\cr '/, now the \\’s are line dividers 
\halign{\line{\titlef ont\hf il##}\\#4\unskip\\} 

\titlepage\vf ill\eject} */, output the chapter title page 
\tenpoint\noindent\ignorespaces} the first paragraph is not indented 
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An extra page is ejected at the end of a chapter, if necessary, so that the 
closing quotations will occur on a right-hand page. (The logic for doing this is not 
perfect, but it doesn’t need to be, because it fails only when the chapter has to be 
shortened or lengthened anyway; book preparation with TpX . as with type, encourages 
interaction between humans and machines. ) The lines of the quotations are set flush 
right by using \obeylines together with a stretchable \leftskip: 

\outer\def \endchapter{\ifodd\pageno \else\vf ill\eject\null\f i 
\begingroup\bigskip\vf ill '/, beginning of the quotes 
\def \e ject{\endgroup\e ject} / ending of the quotes 
\def \par{\ if hmode\/\endgraf \fi}\obey lines 
\def \TeX{T\kern- . 2em\lower . 5ex\hbox{E}X} 

\eightpoint \let\tt=\ninett \baselineskip=10pt \interlinepenalty=10000 
\leftskip=Opt plus 40pc minus \parindent \parf illskip=Opt 
\let\rm=\eightss \let\sl=\eightssi \everypar{\sl}} 

\def \author#l (#2){\smallskip\noindent\rm #l\unskip\enspace (#2)} 

We come now to what goes on inside the chapters themselves. Dangerous and 
doubly dangerous bends are specified by typing ‘\danger’ or ‘\ddanger’ just before a 
paragraph that is supposed to display a warning symbol: 

\def \dbend{{\manual\charl27}} '/, "dangerous bend" sign 
\def \d@nger{\medbreak\begingroup\clubpenalty =10000 

\def \par{\endgraf\endgroup\medbreak} \noindent\hang\hangafter=-2 
\hbox toOpt{\hskip-\hangindent\dbend\hf ill}\ninepoint} 

\outer\def \danger{\d@nger} 

\def \dd@nger{\medbreak\begingroup\clubpenalty= 10000 

\def \par{\endgraf\endgroup\medbreak} \noindent\hang\hangafter=-2 
\hbox toOpt{\hskip-\hangindent\dbend\kernlpt\dbend\hf ill}\ninepoint} 
\outer\def \ddanger{\dd@nger} 

\def \enddanger{\endgraf\endgroup} '/, omits the \medbreak 

(It’s necessary to type ‘\enddanger’ at the end of a dangerous bend only in rare cases 
that a medium space is not desired after the paragraph; e.g., ‘\smallskip\item’ might 
be used to give an itemized list within the scope of the dangerous bend sign.) 

A few chapters and appendices of this book (e.g., Chapter 18 and Appendix B) 
are divided into numbered subsections. Such subsections are specified in the manuscript 
by typing, for example, 

\subsection Allocation of registers. 

Appendix A is subdivided in another way, by paragraphs that have answer numbers: 

\outer\def \subsection#l . {\medbreak\advance\subsecno by 1 
\noindent{\it \the\subsecno . \enspace#l . \enspace}} 

\def \ansno#l . #2 : {\medbreak\no indent 

\hbox to\parindent{\bf \hss#l . #2 . \enspace}\ignorespaces} 

We will see below that the manuscript doesn’t actually specify an \ansno directly; each 
call of \ansno is generated automatically by the \answer macro. 

Appendix H points out The T^Xbook calls for three hyphenation exceptions: 

\hyphenation{man-u-script man-u-scripts ap-pen-dix} 
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A few macros in manmac provide special constructions that are occasionally 
needed in paragraphs: \MF for ‘METRFONT’, \AmSTeX for ‘■4^Vf<S-T}gX’, \bull for 
\dn and \up for ‘4-’ and ‘t’, \ I and \] for 1 1 1 and ‘ u ’. To typeset 

3pt of (stuff), '105 = 69, "69 = 105, |wow| 

the manuscript says 

$3\pt$ of \<stuff>, $\oct{105}=69$ , $\hex{69}=105$ , \cstok{wow} 
using the macros \pt, \<, \oct, \hex, and \cstok. 

\def \MF{{\manual META}\-{\manual FOHT}} 

\def \AmSTeX{$\cal A\kern-. 1667em\lower . 5ex\hbox{$\cal M$}\kern- . 075em 
S$-\TeX} 

\def \bull{\vrule height . 9ex width . 8ex depth -.lex } '/, square bullet 
\def\SS{{\it SS}} '/, scriptscript style 

\def \dn{\leavevmode\hbox{\tt\char ’ 14}} '/, downward arrow 
\def \up{\leavevmode\hbox{\tt\char ’ 13}} '/, upward arrow 
\def \ | {\leavevmode\hbox{\tt\char ‘ \ I }} / vertical line 
\def \] {\leavevmode\hbox{\tt\char ‘ \ }} '/, visible space 

\def \pt{\ , {\rm pt}} */, units of points, in math formulas 

\def \<#l>{\leavevmode\hbox{$\langle$#l\/$\rangle$}} '/, syntactic quantity 
\def \oct#l{\hbox{\rm\ ’ {}\kern- . 2em\it#l\/\kern. 05em}} '/, octal constant 
\def \hex#l{\hbox{\rm\H{}\tt#l}} '/, hexadecimal constant 

\def \cstok#l{\leavevmode\thinspace\hbox{\vrule\vtop{\vbox{\hrule\kernlpt 
\hbox{\vphantom{\tt/}\thinspace{\tt#l}\thinspace}} 
\kernlpt\hrule}\vrule}\thinspace} */, control sequence token 

Displays in this book are usually indented rather than centered, and they usu- 
ally involve text rather than mathematics. The manmac format makes such displays 
convenient by introducing two macros called \begindisplay and \enddisplay; there’s 
also a pair of macros \begintt and \endtt for displays that are entirely in typewriter 
type. The latter displays are copied verbatim from the manuscript file, without inter- 
preting symbols like \ or $ in any special way. For example, part of the paragraph 
above was typed as follows: 

... To typeset 
\begindisplay 

$3\pt$ of \<stuff>, $\oct{105}=69$ , $\hex{69}=105$ , \cstok{wow} 

\enddisplay 

the manuscript says 

\begintt 

$3\pt$ of \<stuff>, $\oct{105}=69$ , $\hex{69}=105$ , \cstok{wow} 
\endtt 

using the macros I \pt I , I \< I , |\oct|, |\hex|, and |\cstok|. 

(The last line of this example illustrates the fact that verbatim typewriter text can be 
obtained within a paragraph by using vertical lines as brackets.) The \begindisplay 
macro is actually more general than you might expect from this example; it allows 
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multiline displays, with \cr following each line, and it also allows local definitions 
(which apply only within the display) to be specified immediately after \begindisplay. 

\outer\def \begindisplay{\obey line s\startdi splay} 

{\obeylines\gdef \startdi splay #1 

{\catcode ‘ \“M=5$$#l\halign\bgroup\indent##\hf il&&\qquad##\hf il\cr}} 
\outer\def \enddisplay{\crcr\egroup$$} 

\chardef \other=12 

\def \ttverbatim{\begingroup \catcode ‘\\=\other \catcode ‘\{=\other 
\catcode ‘\}=\other \catcode ‘\$=\other \catcode ‘ \&=\other 
\catcode ‘\#=\other \catcode ‘ V/.=\other \catcode ‘ \~=\other 
\catcode ‘\_=\other \catcode ‘\~=\other 
\obeyspaces \obeylines \tt} 

{\obeyspaces\gdef {\ }} '/, \obeyspaces now gives \ , not \space 

\outer\def \begintt{$$\let\par=\endgraf \ttverbatim \parskip=Opt 
\catcode ‘\ I =0 \rightskip=-5pc \ttfinish} 

{\catcode ‘ \ I =0 I catcode ‘ I \=\other '/, I is temporary escape character 
lobeylines '/, end of line is active 

I gdef | ttf inish#l“"M#2\endtt{#l I vbox{#2} I endgroup$$}} 

\catcode ‘\ I =\active 

{\obeylines\gdef I {\ttverbatim\spaceskip=\ttglue\let""M=\ \let I =\endgroup}} 

These macros are more subtle than the others in this appendix, and they deserve 
careful study because they illustrate how to disable l^jX’s normal formatting. The 1 1 ’ 
character is normally active (category 13) in manmac format, and its appearance causes 
the \ttverbatim macro to make all of the other unusual characters into normal symbols 
(category 12). However, within the scope of \begintt. . . \endtt a vertical line is an 
escape character (category 0); this permits an escape out of verbatim mode. 

The \begintt macro assumes that a comparatively small amount of text will 
be displayed; the verbatim lines are put into a vbox, so that they cannot be broken 
between pages. A different approach lias been used for most of the typewriter copy 
in this appendix and in Appendix B: Material that is quoted from a format file is 
delimited by \beginlines and \endlines, between which it is possible to give com- 
mands like ‘\smallbreak 1 to help with spacing and page breaking. The \beginlines 
and \endlines macros also insert rules, fore and aft: 

\def \beginlines{\par\begingroup\nobreak\medskip\par indent =0pt 
\hrule\kernlpt\nobreak \obeylines \everypar{\strut}} 
\def\endlines{\kernlpt\hrule\endgroup\medbreak\no indent} 

For example, the previous three lines were typeset by the specification 

\beginlines 

I \def \beginlines{\par\begingroup\nobreak\medskip\par indent =0pt I 
\nobreak 

I \hrule\kernlpt\nobreak \obeylines \everypar{\strut}} I 
I \def\endlines{\kernlpt\hrule\endgroup\medbreak\no indent} I 
\endlines 
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A strut is placed in each line so that the rules will be positioned properly. The manmac 
format also has macros \beginmathdemo . . . \endmathdemo that were used to produce ex- 
amples of mathematics in Chapters 16-19, \beginsyntax . . Aendsyntax for the formal 
syntax in Chapters 24-26, \beginchart . . . \endchart for the font tables in Appendices 
C and F, etc.; those macros are comparatively simple and they need not be shown here. 

Exercises are specified by an \exercise macro; for example, the first exercise 
in Chapter 1 was generated by the following lines in the manuscript: 

\exercise After you have mastered the material in this book, 
what will you be: A \TeX pert, or a \TeX nician? 

\answer A \TeX nician (underpaid) ; sometimes also called 
a \TeX acker. 

Notice that the \answer is given immediately after each exercise; that makes it easy 
to insert new exercises or to delete old ones, without keeping track of exercise num- 
bers. Exercises that are dangerous or doubly dangerous are introduced by the macros 
\dangerexercise and \ddangerexercise. 

\outer\def \exercise{\medbreak \global\advance\exno by 1 

\noindent\llap{\manual\char ’ 170\rm\kern. 15em}'/, triangle in margin 
{\ninebf EXERCISE \bf \chapno . \the\exno}\par\nobreak\noindent} 

\def \dexercise{\global\advance\exno by 1 

\llap{\manual\char ’ 170\rm\kern. 15em}'/, triangle in indented space 
{\eightbf EXERCISE \bf\chapno . \the\exno}\hf il\break} 

\outer\def \dangerexercise{\d@nger \dexercise} 

\outer\def \ddangerexercise{\dd@nger \dexercise} 

(The last two lines use \d@nger and \dd@nger, which are non-\outer equivalents of 
\danger and \ddanger; such duplication is necessary because control sequences of type 
\outer cannot appear within a \def.) 

The \answer macro copies an answer into a file called answers.tex; then 
Appendix A reads this file by saying ‘\immediate\closeout\ans \ninepoint \input 
answers’. Each individual answer ends with a blank line; thus, \par must be used 
between the paragraphs of a long answer. 

\newwrite\ans 

\immediate\openout\ans=answers */, file for answers to exercises 
\outer\def \answer{\par\medbreak 
\immediate\write\ans{} 

\immediate\write\ans{\string\ansno\chapno . \the\exno : } 

\copytoblankline} 

\def \copytoblankline{\begingroup\setupcopy\copyans} 

\def \setupcopy{\def\do##l{\catcode ‘##l=\other}\dospecials 
\catcode A I =\other \obeylines} 

{\obeylines \gdef \copyans#l 
{\def\next{#l}'/, 

\ifx\next\empty\let\next=\endgroup */, 

\else\immediate\write\ans{\next} \let\next=\copyans\f i\next}} 


strut 

llap 

outer 

immediate 

closeout 

input 

newwrite 

openout 

write 

the 



Appendix E: Example Formats 423 


Notice the use of \dospecials here, to set up the verbatim copying. The \ttverbatim 
macro could have invoked \dospecials in the same way; but \ttverbatim is used quite 
frequently, so it was streamlined for speed. 

The remaining macros in manmac format are designed to help in producing 
a good index. When a paragraph contains a word or group of words that deserve to 
be indexed, the manuscript indicates this by inserting "{...}; for example, the first 
sentence of the present paragraph actually ends with ‘a good "{index}’. This caused 
an appropriate entry to be written onto a file index.tex when TjgX was typesetting 
the page; it also put the word ‘index’ into the margin of the proofsheets, so that the 
author could remember what had been marked for indexing without looking into the 
manuscript file. Indexing with the "{...} notation doesn’t change IpX's behavior in 
any essential way; thus, the word ‘index’ appears in the text as well as in the index. 

\newwrite\inx 

\iramediate\openout\inx=index / file for index reminders 

\def \marginstyle{\sevenrm */, marginal notes are in 7-point type 

\vrule height6pt depth2pt widthOpt } */, a strut for \insert\margin 

Sometimes it is desirable to index words that don’t actually appear on the 
page; the notation ""{. . .} stands for a “silent” index entry, and spaces are ignored 
after the closing “}’ in such a case. For example, Appendix I lists page 1 under ‘beauty’, 
even though page 1 contains only the word ‘beautiful’; the manuscript achieves this 
by saying ‘beautiful ""{beauty}’. (The author felt that it was important to index 
‘beauty’ because he had already indexed ‘truth’.) 

It’s not difficult to make " into an active character that produces such index 
entries, while still retaining its use for superscripts in math formulas, because \ifmmode 
can be used to test whether a control sequence is being used in math mode. However, 
manmac ’s use of " as an active character means that ~~M cannot be used to refer to a 
(return) character. Fortunately the **M notation isn’t needed except when the format- 
ting macros themselves are being defined. 

The following macros set things up so that " and "" are respectively converted 
to \silentf alse\xref and \silenttrue\xref , outside of math mode: 

\newif\if silent 

\def \specialhat{\ifmmode\def \next{"}\else\let\next=\beginxref\f i\next} 

\def \beginxref {\futurelet\next\beginxref switch} 

\def \beginxref switch{\ifx\next\specialhat\let\next=\silentxref 
\else\silentfalse\let\next=\xref \f i \next} 

\catcode ‘\"=\active \let "=\specialhat 
\def \silentxref "{\silenttrue\xref } 

Entries in the index aren’t always words in roman type; they might require 
special typesetting conventions. For example, there are hundreds of items in Appen- 
dix I that are preceded by a backslash and set in typewriter type. The manmac for- 
mat makes it easy to produce such entries by typing, e.g., ‘“ I Mmmediate I’ instead 
of ‘"{ I Mmmediate I }’. In this case the backslash is not written onto the index file, 
because it would interfere with alphabetization of the entries; a code number is written 
out so that the backslash can be reinstated after the index has been sorted. The code 
number also is used to put the entry in typewriter type. 


dospecials 

verbatim copying 

efficiency 

index 

strut 

newwrite 

immediate 

openout 

ifmmode 

hat as an active character 

hat hat 

newif 

futurelet 



424 Appendix E: Example Formats 


The indexing macros of manmac produce entries of four kinds, which are as- 
signed to codes 0, 1, 2, and 3. Code 0 applies when the argument is enclosed in braces, 
e.g., ‘“{word}’ ; code 1 applies when the argument is enclosed in vertical lines and there’s 
no backslash, e.g., “I plus I’; code 2 is similar but with a backslash, e.g., ‘~|\par|’; 
code 3 applies when the argument is enclosed in angle brackets, e.g., ‘~\<stuff>’. The 
four example entries in the previous sentence will be written on file index.tex in the 
form 

word ! 0 123. 
plus ! 1 123. 
par ! 2 123. 
stuff ! 3 123. 

if they appear on page 123 of the book. 

\chardef\bslash=‘\\ '/. \bslash makes a backslash (in tt fonts) 
\def\xref{\futurelet\next\xref switch} '/, branch on the next character 
\def \xref switch{\begingroup\ifx\next I \aftergroup\vxref 
\else\ifx\next\<\aftergroup\anglexref 

\else\aftergroup\normalxref \fi\fi \endgroup} 

\def \vxref I {\catcode ‘ \\=\active \futurelet\next\vxref switch} 

\def \vxref switch#l I {\catcode ‘\\=0 
\ifx\next\empty\def\xreftype{2}'/, 

\def\next{{\tt\bslash\text}}*/, code 2, |\arg| 
\else\def\xreftype{l}\def\next{{\tt\text}}\f i '/, code 1, largl 
\edef\text{#l}\makexref} 

{\catcode ‘ \ I =0 \catcode ‘\\=\active lgdef\{}} 

\def \anglexref \<#l>{\def \xreftype{3}\def\text{#l}'/, 

\def \next{\<\text>}\makexref} '/, code 3, \<arg> 

\def \normalxref#l{\def \xreftype{0}\def \text{#l}\let\next=\text\makexref} 

Indexing is suppressed unless the proofmode switch is set to true, since mate- 
rial is gathered for the index only during trial runs — not on the triumphant occasion 
when the book is finally being printed. 

\newif\if proof mode 

\proofmodetrue '/, this should be false when making camera-ready copy 
\def \makexref {\if proof mode\insert\margin{\hbox{\marginstyle\text}}'/, 

\xdef \writeit{\write\inx{\text\space ! \xreftype\space 
\noexpand\number\pageno . }}\writeit 
\else\ifhmode\kernOpt \fi\fi 
\if silent \ ignore spaces\else\next\f i} 

(The \ insert suppresses hyphenation when proofs are being checked; a \kern0pt is 
therefore emitted to provide consistent behavior in the other case. ) 

The material that accumulates on file index.tex gives a good first approxi- 
mation to an index, but it doesn’t contain enough information to do the whole job; a 
topic often occurs on several pages, but only the first of those pages is typically listed 
in the file. The author prefers not to generate indexes automatically; he likes to reread 
his books as he checks the cross-references, thereby having the opportunity to rethink 
everything and to catch miscellaneous errors before it is too late. As a result, his books 


catcode 

xdef 

write 

noexpand 

number 

ignorespaces 

insert 

hyphenation 

kern 

Knuth 
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tend to be delayed, but the indexes tend to be pretty good. Therefore he designed 
the indexing scheme of manmac to provide only the clues needed to make a real index. 
On the other hand, it would be possible to extend the macros above and to obtain 
a comprehensive system that generates an excellent index with no subsequent human 
intervention; see, for example, “An indexing facility for 1)?X” by Terry Winograd and 
Bill Paxton, in TUGboat 1 (1980), A1-A12. 

The manmac macros have now been fully presented; we shall close this appendix 
by presenting one more example of their use. Chapter 27 mentions the desirability of 
creating a long book in small parts, by using a “galley” file. The author adopted that 
strategy for The TpXhook, entering each chapter into a small file galley.tex that 
looked like this: 

\input manmac 

\tenpoint 

\pageno=800 

\def \rhead{Experimental Pages for The \TeX book} 

\def\chapno{ X} 

{\catcode ‘ V/,=12 \immediate\write\ans{'/, Answers for galley proofs:}} 


Winograd 

Paxton 

TUGboat 

galley 

BROWN 

epigraph 

Knuth 

Crocker 

VAN LEUNEN 


(new text being tested, usually an entire chapter) 


that blank line will stop an unfinished \answer 
\immediate\closeout\ans 
\vf ill\e ject 

\ninepoint \input answers '/, typeset the new answers, if any 
\bye 


It is much easier to use macros than to define them. 

The use of macro libraries, in fact, mirrors almost exactly 
the use of subroutine libraries for programming languages. 

There are the same levels of specialization, 
from publicly shared subroutines 
to special subroutines within a single program, 
and there is the same need for a programmer 
with particular skills to define the subroutines. 

— PETER BROWN, Macro Processors (1974) 

The epigraph is among the most delightful of scholarly habits. 
Donald Knuth 's work on fundamental algorithms would be 
just as important if he hadn't begun with a quotation 
from Betty Crocker, but not so enjoyable. 

Part of the fun of an epigraph is turning a source to an unexpected use. 

— MARY-CLAIRE VAN LEUNEN, A Handbook for Scholars (1978) 
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The purpose of this appendix is to summarize the chief characteristics of the 
Computer Modern typefaces. TgX is able to typeset documents with any fonts, 
having any arrangement of characters; the fonts and layouts to be described here 
are the particular ones that correspond to plain TgX format, i.e. , to the macros 
in Appendix B. (Complete information about the Computer Modern family, 
including the METRFONT programs that draw the characters, can be found in 
the author’s book Computer Modern Typefaces.) 

The first pages of this appendix show what the fonts contain; the last 
pages show what the symbols are called when they’re used in math formulas. 
(See Appendix B for the conventions that apply in non-mathematical text.) 

There are exactly 128 different characters in each of the Computer Mod- 
ern fonts, although Tf^X can work up to 256 characters per font. The text fonts 
are laid out as shown in the table below, which illustrates font cmrlO (Computer 
Modern Roman 10 point). Thus, for example, if you ask for \char’35 when 
cmrlO is the current font, you get the symbol M. These text fonts include the 
ligatures and accents described in Chapter 9; each symbol that happens to be 
a visible ASCII character appears in its ASCII position. Some of the ASCII 
symbols (namely " < > \ _ { I }) are not included because they don’t occur 
in normal printer’s fonts. If you mistakenly type ", you get ”; and < outside of 
math mode yields j ! Incidentally, the ten digits all have width 0 . 5em. 


Figure 1. Text font layout, showing cmrlO (\rm, \textfontO). 
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Plain TgX makes use of sixteen basic fonts: 


cmrlO 

cmr7 

cmr5 

cmbxlO 

cmbx7 

cmbx5 

cmsllO 

cmtilO 

cmttlO 

cmmilO 

cmmi7 

cmmi5 

cmsylO 

cmsy7 

cmsy5 

cmexlO 


(Computer Modern 
(Computer Modern 
(Computer Modern 
(Computer Modern 
(Computer Modern 
(Computer Modern 
(Computer Modern 
(Computer Modern 
(Computer Modern 
(Computer Modern 
(Computer Modern 
(Computer Modern 
(Computer Modern 
(Computer Modern 
(Computer Modern 
(Computer Modern 


Roman 10 point) 

Roman 7 point) 

Roman 5 point) 

Bold Extended 10 point) > 
Bold Extended 7 point) 
Bold Extended 5 point) 
Slanted Roman 10 point) 
Text Italic 10 point) 
Typewriter Type 10 point) 
Math Italic 10 point) 

Math Italic 7 point) 

Math Italic 5 point) 

Math Symbols 10 point) 
Math Symbols 7 point) 
Math Symbols 5 point) 
Math Extension 10 point) 


text 


> special 


The first eight of these all have essentially the same layout; but cmr5 needs no 
ligatures, and many of the symbols of cmtilO have different shapes. For example, 
the ampersand becomes an ‘E.T.’, and the dollar changes to pound sterling: 


ampersand 

E.T. 

dollar 

pound 

sterling 

it 


Figure 2. Text font layout, showing cmtilO (\it). 
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The typewriter font cmttlO is almost like the fonts for ordinary text, but it 
includes all of the visible ASCII characters, in their correct positions. It also has 
vertical arrows t and 4, as well as an undirected single quote mark, ' . Fourteen 
of the 128 positions are changed from the normal text layout conventions, namely 
codes '013-017, '040 , '042, '074, '076, '134, '137, and '173-T75. All of the 
ligatures are absent, except for the Spanish j and (The characters for Spanish 
ligatures appear in different positions, but that makes no difference to the user, 
because each font tells TgX where to locate its own ligatures.) The Polish 1, the 
dot accent, and the long Hungarian umlaut have disappeared to make room for 
new symbols. In a sense, positions '052 and '055 also differ from the normal 
text conventions: The asterisk is not up as high as usual, and the hyphen is just 
like a minus sign. 

Each character in cmttlO has the same width, namely 0.5em; the spaces 
between words also have this width, and they will not stretch or shrink. TpX puts 
two spaces at the end of each sentence when you are typesetting with a typewriter 
font. (These spacing conventions can be changed by assigning nonzero values to 
\spaceskip and \xspaceskip; or you can assign new values to the \f ontdimen 
parameters, which will be described shortly.) 


Figure 3. Typewriter text font layout, showing cmttlO (\tt). 
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typewriter font 
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xspaceskip 

tt 
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You can see at a glance that the math italic font, cmmilO, is quite different from 
text italic. It contains lowercase Greek letters as well as uppercase ones; this, 
of course, is mathematicians’ Greek, not a text font that would be suitable for 
typesetting classical Greek literature. And if you look closely at the non-Greek 
italic letters, you will find that their proportions and spacing have been changed 
from cmtilO to make them work better in TjgX’s mathematics mode. 

Some special unslanted characters appear in positions '050 -'077 and 
'133-T37 , including “oldstvle numerals” : ‘$\mitl984$’ and ‘$\oldstylel984$’ 
both yield ‘1984’. Some of the characters are intended to be combined with 
others; for example, '054 forms the first part of the symbol P. (See the 
definition of \hookrightarrowin Appendix B.) This portion of the font doesn’t 
deserve the name math italic; it’s really a resting place for characters that don’t 
fit anywhere else. (The author didn’t want to leave any places unfilled, since 
that would tempt people to create incompatible ways to fill them.) 

Plain TgX takes its comma, period, and slash from cmmilO in math 
mode, so that appropriate kerning will be computed in certain formulas that 
would otherwise be spaced poorly. For the correct positioning of math accents 
with this font, you should set its \skewchar to '177 . 


cmmilO 

Greek 

oldstyle numerals 

hookrightarrow 

comma 

period 

slash 

skewchar 

mit 


Figure 4. Math italic font layout, showing cmmilO (\mit, \textfontl). 
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When T^X typesets mathematics it assumes that family 0 contains normal roman 
fonts and that families 1, 2, and 3 contain math italic, math symbol, and math 
extension fonts. The special characters in these fonts are usually given symbolic 
names by a \mathchardef instruction, which assigns a hexadecimal code to the 
symbol. This code has four digits, where the first tells what kind of symbol is 
involved, the second specifies the family, and the other two give the font position. 
For example, 

\mathchardef \11= " 32 1C 

says that \11 is character "1C of the math symbol font (family 2), and that it’s 
a “relation” (class 3). A complete list of the symbolic names provided by the 
plain Tf^X format appears later in this appendix. 

Font cmsylO is plain TgX’s math symbol font, and it contains 128 sym- 
bols laid out as shown below. Its \skewchar should be set to '060 so that math 
accents will be positioned properly over the calligraphic capital letters. 


11 

cmsylO 

skewchar 

calligraphic capital letters 
cal 


Figure 5. Math symbol font layout, showing cmsylO (\cal, \textfont2). 



'0 

'1 

'2 

'3 

'4 

'5 

'6 

7 


'00x 

- 


X 

* 

0 

O 

± 

T 

"Ox 

'Olx 

0 

© 

(3 

O 

© 

0 

O 

• 

'02x 

X 

= 

c 

D 

< 

> 

© 

© 

"lx 

'03x 


PS 

c 

D 

<c 

» 

© 

© 

' 04 x 

< - 

-> 

t 


-H- 

X 

\ 

~ 

"2x 

'05x 

<= 


IT 

D- 

<0 

\ 

X 

oc 

'06x 

/ 

00 

G 


A 

V 

/ 

1 

"3x 

'Olx 

V 

3 

— 1 

0 

1 


T 

A 

'10x 

K 

A 

B 

c 

V 

£ 

T 

Q 

"4x 

'llx 

n 

1 

J 

K, 

C 

M 

X r 

O 

'12x 

V 

Q. 

n 

S 

T 

U 

V 

W 

"5x 

'13x 

X 

y 

z 

u 

n 

l+J 

A 

V 

'Ux 

b 

H 

L 

J 

r 

1 

{ 

} 

"6x 

'15x 

{ 

) 

1 

II 

X 


\ 

l 

'16x 

V 

II 

V 

/ 

u 

n 

C 

□ 

"7x 

'llx 

§ 

t 

t 

If 


❖ 


A 


"8 

"9 

"A 

"B 

"C 

"D 

"E 

"F 




432 Appendix F: Font Tables 


The final font of plain TpX is cmexlO, which includes large symbols and 
pieces that can be used to build even larger ones. For example, arbitrarily large 
left parentheses can be constructed by putting '060 at the top and '100 at the 
bottom, and by using as many copies of '102 as necessary in the middle. Large 
square root signs are made from '164 , '165 , and '166 ; large left braces have four 
component parts: '070, '072, '074, 076. 


cmexlO 

square root signs 
Math extension font 


Figure 6. Math extension font layout, showing cmexlO (\textf ont3). 
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When Tf’X “loads” a font into its memory, it doesn’t look at the actual shapes of 
the characters; it only loads the font metric information (e.g., cmrlO . tfm), which 
includes the heights, widths, depths, and italic corrections, together with infor- 
mation about ligatures and kerning. Furthermore, the metric information that 
comes with a font like cmexlO tells TgX that certain characters form a series; for 
example, all of the left parentheses are linked together in order of increasing size: 
'000, '020, '022, and ' 040 , followed by the extensible left parenthesis, which 
is '060 + ['102] n + TOO . Similarly, the two summation signs {'120 , T30) and 
the three \widehat accents ( T 42 , '143 , '144 ) are linked together. Appendix G 
explains how TgX goes about choosing particular sizes for math delimiters, math 
operators, and math accents. 

Each font also has at least seven \fontdimen parameters, which have 
the following significance and typical values (rounded to two decimal places): 


# 

Meaning Value 

in cmrlO 

cmbxlO 

cmsllO 

cmtilO 

cmttlO 

cmmilO 

1 

slant per pt 

0.00 pt 

0.00 pt 

0.17 pt 

0.25 pt 

0.00 pt 

0.25 pt 

2 

interword space 

3.33 pt 

3.83 pt 

3.33 pt 

3.58 pt 

5.25 pt 

0.00 pt 

3 

interword stretch 

1.67 pt 

1.92 pt 

1.67 pt 

1.53 pt 

0.00 pt 

0.00 pt 

4 

interword shrink 

1.11 pt 

1.28 pt 

1.11 pt 

1.02 pt 

0.00 pt 

0.00 pt 

5 

x-height 

4.31 pt 

4.44 pt 

4.31 pt 

4.31 pt 

4.31 pt 

4.31 pt 

6 

quad width 

10.00 pt 

11.50pt 

10.00 pt 

10.22 pt 

10.50 pt 

10.00 pt 

7 

extra space 

1.11 pt 

1.28 pt 

1.11 pt 

1.02 pt 

5.25 pt 

0.00 pt 


The slant parameter is used to position accents; the next three parameters define 
interword spaces when text is being typeset; the next two define the font-oriented 
dimensions lex and lem; and the last is the additional amount that is added to 
interword spaces at the end of sentences (i.e., when \spacef actor is 2000 or more 
and \xspaceskip is zero). When a font is magnified (using ‘at’ or ‘scaled’), all 
of the parameters except the slant are subject to magnification at the time the 
font is loaded into TgX’s memory. 

Notice that cmmilO has zero spacing. This is the mark of a font that is 
intended only for mathematical typesetting; the rules in Appendix G state that 
the italic correction is added between adjacent characters from such fonts. 

Math symbol fonts (i.e., fonts in family 2) are required to have at least 
22 \f ontdimen parameters instead of the usual seven; similarly, math extension 
fonts must have at least 13. The significance of these additional parameters is 
explained in Appendix G. If you want to increase the number of parameters past 
the number that actually appear in a font’s metric information file, you can assign 
new values immediately after that font has been loaded. For example, if some 
font \ff with seven parameters has just entered TgX’s memory, the command 
\f ontdimenl3\f f =5pt will set parameter number 13 to 5pt; the intervening 
parameters, numbers 8-12, will be set to zero. You can even give more than 
seven parameters to \nullfont, provided that you assign the values before any 
actual fonts have been loaded. 
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Now that the font layouts have all been displayed, it’s time to consider 
the names of the various mathematical symbols. Plain Tf^X defines more than 
200 control sequences by which you can refer to math symbols without having to 
find their numerical positions in the layouts. It’s generally best to call a symbol 
by its name, for then you can easily adapt your manuscripts to other fonts, and 
your manuscript will be much more readable. 

The symbols divide naturally into groups based on their mathematical 
class (Ord, Op, Bin, Rel, Open, Close, or Punct), so we shall follow that order 
as we discuss them. N.B.: Unless otherwise stated, math symbols are available 
only in math modes. For example, if you say ‘\alpha’ in horizontal mode, T13X 
will report an error and try to insert a $ sign. 

1. Lowercase Greek letters. 


a 

\alpha 

i \iota 

Q 

\varrho 

p 

\beta 

k \kappa 

a 

\ sigma 

7 

\gamma 

A \ lambda 

A 

\var sigma 

S 

\delta 

p \mu 

T 

\tau 

e 

\epsilon 

v \nu 

V 

\upsilon 

£ 

\varepsilon 

i \xi 

<P 

\phi 

c 

\zeta 

0 o 

V 

\varphi 

V 

\eta 

TT \pi 

X 

\chi 

9 

\theta 

w \ varpi 

ib 

\psi 

9 

\vartheta 

p \rho 

CO 

\ omega 


There’s no \omicron, because it would look the same as o. Notice that the letter 
\upsilon (v) is a bit wider than v (v); both of them should be distinguished 
from \nu (n). Similarly, \varsigma (<j) should not be confused with \zeta ((). It 
turns out that \varsigma and \upsilon are almost never used in math formulas; 
they are included in plain TgX primarily because they are sometimes needed in 
short Greek citations (cf. Appendix J). 


A Uppercase Greek letters. 


r \ Gamma 
A \Delta 
0 \Theta 
A \Lambda 


E \Xi 
n \Pi 
X \Sigma 
T \Upsilon 


$ \Phi 
4/ \Psi 
O \ Omega 


The other Greek capitals appear in the roman alphabet (\ Alpha = {\rm A}, 
\Beta = {\rm B>, etc.). It’s conventional to use unslanted letters for uppercase 
Greek, and slanted letters for lowercase Greek; but you can obtain (T, A, . . . , 1?) 
by typing $ ({\mit\Gamma} , {\mit\Delta} , \ldots, {\mit\ Omega}) $. 


3. Calligraphic capitals. To get the letters A ... Z that appear in Figure 5, type 
${\cal A}\ldots{\cal Z}$. Several other alphabets are also used with math- 
ematics (notably Fraktur, script, and “blackboard bold”); they don’t come with 
plain TgX, but more elaborate formats like AyVItS-TgX do provide them. 


symbols in math, table 

alpha 

iota 

varrho 

beta 

kappa 

sigma 

gamma 

lambda 

varsigma 

delta 

mu 

tau 

epsilon 

nu 

upsilon 

varepsilon 

xi 

phi 

zeta 

varphi 

eta 

p i 

chi 

theta 

varpi 

psi 

vartheta 

rho 

omega 

omicron 

Gamma 

Xi 

Phi 
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Psi 

Theta 

Sigma 

Omega 

Lambda 

Upsilon 

Alpha 

Beta 

mit 

calligraphic letters 

Fraktur 

script 

blackboard bold 
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4- Miscellaneous symbols of type Ord. 


K 

\aleph 

/ \prime 

V 

\f orall 

h 

\hbar 

0 \emptyset 

3 

\exists 

1 

\ imath 

V \nabla 

— 1 

\neg 

J 

\ jmath 

sj \surd 

b 

\f lat 

£ 

\ell 

T \top 

tl 

\natural 

P 

\wp 

_L \bot 

it 

\ sharp 

K 

\Re 

II \l 

* 

\clubsuit 

9 

\Im 

Z \angle 

<> 

\diamondsuit 

d 

\partial 

A \triangle 

V 

\heartsuit 

00 

\infty 

\ \backslash 

A 

\spadesuit 


The dotless letters \imath and \jmath should be used when i and j are accented; 
for example, $\hat\imath$ yields i. The \prime symbol is intended for use in 
subscripts and superscripts, as explained in Chapter 16, so you usually see it in 
a smaller size. On the other hand, the \angle symbol has been built up from 
other pieces; it does not get smaller when it appears in a subscript or superscript. 

5. Digits. To get italic digits 0123456789, say {\it0123456789}; to get boldface 
digits 0123456789 , say {\bf 0123456789}; to get oldstyle digits 0123456789, say 
{\oldstyle0123456789}. These conventions work also outside of math mode. 

6. “Large” operators. The following symbols come in two sizes, for text and 
display styles: 

X! \ sum 

n n N 'P rod 

nil \coprod 

f J \ int 

<fi (j) \oint 

It is important to distinguish these large Op symbols from the similar but smaller 
Bin symbols whose names are the same except for a ‘big’ prefix. Large operators 
usually occur at the beginning of a formula or subformula, and they usually are 
subscripted; binary operations usually occur between two symbols or subformu- 
las, and they rarely are subscripted. For example, 

$\bigcup_{n=l}~m(x_n\cup y_n)$ yields (J™=j ( x n U Vn) 

The large operators \sum, \prod, \coprod, and \int should also be distin- 
guished from smaller symbols called \Sigma (S), \Pi (II), \amalg (II), and 
\smallint (/), respectively; the \smallint operator is rarely used. 


fl fl \ bi S ca P 

U U \bigcup 

U U \ bi s s q cu P 

V \J \bigvee 

A A \ bi g wed g e 


O 0 \bigodot 
0 (0 \bigotimes 
© 0 \bigoplus 

1+) y \biguplus 


aleph 

prime 

forall 

hbar 

emptyset 

exists 

imath 

nabla 

neg 

jmath 

surd 

flat 

ell 

top 

natural 

wp 

bot 

sharp 

Re 

escvert 

clubsuit 

Im 

angle 

diamondsuit 

partial 

triangle 

heartsuit 

infty 

backslash 

spadesuit 

Weierstrass, see wp 

dotless letters 

accent 

digits 

sum 

bigcap 

bigodot 

prod 

bigcup 

bigotimes 

coprod 

bigsqcup 

bigoplus 

int 

bigvee 

biguplus 

oint 

bigwedge 

binary operations 

smallint 
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7. Binary 

operations. 

Besides + and 

— , you can type 



± 

\pm 

n 

\cap 

V 

\vee 

0 

\mp 

u 

\cup 

A 

\wedge 

\ 

\setminus 

i±) 

\uplus 

® 

\oplus 


\cdot 

n 

\sqcap 

0 

\ominus 

X 

\times 

u 

\sqcup 

O 

\otimes 

* 

\ast 

< 

\triangleleft 

o 

\oslash 

* 

\star 

> 

\triangleright 

0 

\odot 

o 

\diamond 

X 

\wr 

t 

\ dagger 

o 

\circ 

0 

\bigcirc 

t 

\ ddagger 

• 

\bullet 

A 

\bigtriangleup 

II 

\amalg 

-r 

\div 

V 

\bigtriangledown 




It’s customary to say $G\backslashH$ to denote double cosets of G by Ft (G\H), 
and $p\backslash n$ to mean that p divides n (i>\ii I: but $X\setminus Y$ 
denotes the elements of set X minus those of set I' (. X \ I'). Both operations 
use the same symbol, but \backslash is type Ord, while \setminus is type Bin 
(so TgX puts more space around it). 

8. Relations. Besides <, >, and =, you can type 


< 

\leq 

> \geq 

= 

\equiv 

-< 

\prec 

>- \succ 

r>-/ 

\sim 


\preceq 

A \succeq 


\ simeq 

<0 

Ml 

> \gg 

x 

\asymp 

C 

\subset 

D \supset 


\approx 

C 

\subseteq 

D \supseteq 


\cong 

c 

\sqsubseteq 

□ \sqsupseteq 

1X1 

\bowtie 

G 

\in 

9 \ni 

(X 

\propto 

h 

\vdash 

H \dashv 

1= 

\models 

— 

\smile 

| \mid 

= 

\doteq 


\frown 

j \parallel 

_L 

\perp 


The symbols \mid and \parallel define relations that use the same characters 
as you get from | and \ I ; TgX puts space around them when they are relations. 


9. Negated relations. Many of the relations just listed can be negated or “crossed 
out” by prefixing them with \not, as follows: 


t \not< 

■£. \not\leq 
t \not\prec 
t \not\preceq 
(jL \not\subset 
\not\subseteq 
t \not\sqsubseteq 


t \not> 
it \not\geq 
/ \not\succ 
t \not\succeq 
t> \not\supset 
t> \not\supseteq 
jZj \not\sqsupseteq 


t \not= 
t \not\equiv 
t \not\sim 
t \not\simeq 
\not\approx 
^ \not\cong 
t \not\asymp 


pm 

cap 

vee 

mp 

cup 

wedge 

setminus 

uplus 

oplus 

cdot 

sqcap 

ominus 

times 

sqcup 

otimes 

ast 

triangleleft 

oslash 

star 

triangleright 

odot 

diamond 

wr 

dagger 

circ 

bigcirc 

ddagger 

bullet 

bigtriangleup 

amalg 

div 

bigtriangledown 

leq 

geq 

equiv 

prec 

succ 

approx 

preceq 

succeq 

propto 

11 

gg 

asymp 

subset 

supset 

sim 

subseteq 

supseteq 

simeq 

sqsubseteq 

sqsupseteq 

cong 

in 

ni 

bowtie 

vdash 

dashv 

models 

smile 

mid 

doteq 

frown 

parallel 

perp 

not 
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The symbol \not is a relation character of width zero, so it will overlap a relation 
that comes immediately after it. The positioning isn’t always ideal, because some 
relation symbols are wider than others; for example, \not\in gives but it is 
preferable to have a steeper cancellation, The latter symbol is available as a 
special control sequence called \notin. The definition of \notin in Appendix B 
indicates how similar symbols can be constructed. 

10. Arrows. There’s also another big class of relations, namely those that point: 


t— \leftarrow 
<= \Leftarrow 
— > \rightarrow 
=> \Rightarrow 
•f-» \leftrightarrow 
<£> \Leftrightarrow 
i — } \mapsto h 

\hookleftarrow 
\leftharpoonup 
v- \leftharpoondown 
# \rightleftharpoons 


\longleftarrow ]' 

\Longleftarrow f|- 

\longrightarrow j, 

\Longrightarrow 1J. 

\longleftrightarrow t 
\Longleftrightarrow ^ 
\longmapsto 
\hookrightarrow \ 

\rightharpoonup 
\rightharpoondown ’X 


\uparrow 

\Uparrow 

\downarrow 

\Downarrow 

\updownarrow 

\Updownarrow 

\nearrow 

\searrow 

\swarrow 

\nwarrow 


Up and down arrows will grow larger, like delimiters (see Chapter 17). To put 
symbols over left and right arrows, plain T^X provides a \buildrel macro: You 
type \buildrel(superscript)\over(relation), and the superscript is placed on 
top of the relation just as limits are placed over large operators. For example, 


def 


\buildrel \alpha\beta \over \longrightarrow 
\buildrel \rm def \over = 


(In this context, ‘\over’ does not define a fraction.) 

11. Openings. The following left delimiters are available, besides ‘(’: 

[ \lbrack [ \lfloor [ \lceil 

{ \lbrace ( \langle 

You can also type simply ‘ [’ to get \lbrack. All of these will grow if you prefix 
them by \bigl, \Bigl, \biggl, \Biggl, or \left. Chapter 17 also mentions 
\lgroup and \lmonstache, which are available in sizes greater than \big. If 
you need more delimiters, the following combinations work reasonably well in 
the normal text size: 


| \lbrack\ ! \lbrack (( \langle\ ! \langle (( (\!( 

12. Closings. The corresponding right delimiters are present too: 

] \rbrack j \rfloor ] \rceil 

} \rbrace ) \rangle 

Everything that works for openings works also for closings, but reversed. 


notin 

arrows 

leftarrow 

longleftarrow 

uparrow 

Leftarrow 

Longleftarrow 

Uparrow 

rightarrow 

longrightarrow 

downarrow 

Rightarrow 

Longrightarrow 

Downarrow 

leftrightarrow 

longleftrightarrow 

up downarrow 

Leftrightarrow 

Longleftrightarrow 

Updownarrow 

mapsto 

longmapsto 

nearrow 

hookleftarrow 

hookrightarrow 

searrow 

leftharpoonup 

rightharpoonup 

swarrow 

leftharpoondown 

rightharpoondown 

nwarrow 

rightleftharpoons 

buildrel 

over 

left delimiters 

lbrack 

lbrace 

langle 

lfloor 

lceil 

bigl 

Bigl 

biggl 

Biggl 

left 

lgroup 

lmoustache 

rbrack 

rbrace 

rangle 

rfloor 

rceil 
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13. Punctuation. TjnjX puts a thin space after commas and semicolons that ap- 
pear in mathematical formulas, and it does the same for a colon that is called 
\colon. (Otherwise a colon is considered to be a relation, as in ‘x := y’ and 
‘a. : b :: c : d\ which you type by saying ‘$x:=y$’ and ‘$a:b: :c:d$’.) Examples 
of \colon are 

f'.A^B $f\colon A\rightarrow B$ 

L(a,b; c: x,y; z) $L(a,b;c\colon x,y;z)$ 

Plain TgX also defines \ldotp and \cdotp to be and ••’ with the spacing of 
commas and semicolons. These symbols don’t occur directly in formulas, but 
they are useful in the definition of \ldots and \cdots. 

14- Alternate names. If you don’t like plain TgjX’s name for some math symbol — 
for example, if there’s another name that looks better or that you can remember 
more easily — the remedy is simple: You just say, e.g., ‘\let\cupcap=\asymp’. 
Then you can type ‘f (n)\cupcap n’ instead of ‘f (n)\asymp n’. 

Some symbols have alternate names that are so commonly used that 
plain TgjX provides two or more equivalent control sequences: 

\ne or \neq (same as \not=) 

< \le (same as \leq) 

> \ge (same as \geq) 

{ \{ (same as \lbrace) 

} \} (same as \rbrace) 

— » \to (same as \rightarrow) 

<— \gets (same as \leftarrow) 

3 \owns (same as \ni) 

A \land (same as \wedge) 

V \lor (same as \vee) 

-1 \lnot (same as \neg) 

j \vert (same as I) 

|| \Vert (same as \|) 

There’s also \iff ( ), which is just like \Longleftrightarrow except that 

it puts an extra thick space at each side. 

15. Non-math symbols. Plain TgX makes four special symbols available outside 
of math mode, although the characters themselves are actually typeset from the 
math symbols font: 

§ \S 

If \P 

t \dag 

t \ddag 

These control sequences do not act like ordinary math symbols; they don’t change 
their size when they appear in subscripts or superscripts, and you must say, e.g., 


colon 

colon 

ldotp 

cdotp 

ldots 

cdots 

ne 

neq 

le 


ge 

to 

gets 

owns 

land 

lor 

lnot 

vert 

iff 
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$x~{\P}$ instead of $x~\P$ when you use them in formulas. However, the \dag 
and \ddag symbols are available in math mode under the names \dagger and 
\ddagger. It would be easy to define mathematical equivalents of \S and \P, 
if these symbols suddenly caught a mathematician’s fancy. 


Seek not for fresher founts afar, 
Just drop your bucket where you are. 
— SAM WALTER FOSS, Back Country Poems (1892) 

No one compositor will have all the signs and symbols available. 
The number of special signs and symbols is almost limitless, 
with new ones being introduced all the time. 
— UNIVERSITY OF CHICAGO PRESS, Manual of Style (1969) 


dag 

ddag 

dagger 

ddagger 

S 

P 

FOSS 

CHICAGO 
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People who define new math fonts and/or macros sometimes need to know exactly 
how T)gX manipulates the constituents of formulas. The purpose of this appendix is to 
explain the precise positioning rules by which T[-'X converts a math list into a horizontal 
list. (It is a good idea to review the introduction to math lists in Chapter 17 before 
reading further; “double dangerous bends” are implied throughout this appendix.) 

TftX relies on lots of parameters when it typesets formulas, and you have the 
option of changing any or all of them. But of course you will want to know what each 
parameter means, before you change it. Therefore each rule below is numbered, and a 
table appears at the end to show which rules depend on which parameters. 

The most important parameters appear in the symbol fonts (family 2) and 
the extension fonts (family 3). TjgX will not typeset a formula unless \textfont2, 
\scriptf ont2, and \scriptscriptf ont2 each contain at least 22 \fontdimen param- 
eters. For brevity we shall call these parameters <ti to < 722 , where the parameter is 
taken from \textfont2 if the current style is display or text ( D or D' or T or T'), from 
\scriptfont2 if the current style is S or S' , and from \scriptscriptf ont2 otherwise. 
Similarly, the three fonts in family 3 must each have at least 13 \f ontdimen parameters, 
and we will denote them by £1 to £ 13 . The notation £ 9 , for example, stands for the 
ninth parameter of \scriptfont3, if Tj/X is typesetting something in \scriptstyle. 

A math list is a sequence of items of the various kinds listed in Chapter 17, 
and TgX typesets a formula by converting a math list to a horizontal list. When such 
typesetting begins, T)gX has two other pieces of information in addition to the math list 
itself, (a) The starting style tells what style should be used for the math list, unless 
another style is specified by a style item. For example, the starting style for a displayed 
formula is D, but for an equation in the text or an equation number it is T; and for a 
subformula it can be any one of the eight styles defined in Chapter 17. We shall use C 
to stand for the current style, and we shall say that the math list is being typeset in 
style C. (b) The typesetting is done either with or without penalties. Formulas in the 
text of a paragraph are converted to horizontal lists in which additional penalty items 
are inserted after binary operations and relations, in order to aid in line breaking. Such 
penalties are not inserted in other cases, because they would serve no useful function. 

The eight styles are considered to be D > D' > T > T' > S > S' > SS > SS' , 
in decreasing order. Thus, C < S means that the current style is S, S', SS, or SS' . 
Style C' means the current style with a prime added if one isn’t there; for example, we 
have C’ = T' if and only if C = T or C = T' . Style Cf is the superscript style for C: 
this means style S if C is D or T , style S' if C is D' or T' , style SS if C is S or SS, 
and style SS' if C is S' or SS' . Finally, style C\. is the subscript style, which is (C't) , • 

Chapter 17 stated that the most important components of math lists are called 
atoms, and that each atom has three fields called its nucleus, subscript, and superscript. 
We frequently need to execute a subroutine called “Set box x to the so-and-so field in 
style such-and-such.” This means (a) if the specified field is empty, x is set equal to a 
null box; (b) if the field contains a symbol, x is set to an hbox containing that symbol 
in the appropriate size, and the italic correction for the character is included in the 
width of the box; (c) if the field contains a math list or horizontal list, x is set to 
an hbox containing the result of typesetting that list with the specified starting style. 
In case (c), the glue is set with no stretching or shrinking, and an additional level of 
liboxing is omitted if it turns out to be redundant. 

Another subroutine sets box itoa specified variable delimiter, having a spec- 
ified minimum height plus depth. This means that a search is conducted as follows: 
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math lists 
symbol fonts 
extension fonts 
textfont 
scriptfont 
scriptscriptfont 
fontdimen 
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italic correction 
delimiter 
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The delimiter is defined by two symbols, a “small character” a in family / and a “large 
character” b in family g. The search looks first at character a in scriptscriptfont /, if 
C < SS; then it looks at a in scriptfont /, if C < S ; then it looks at a in textfont /. If 
nothing suitable is found from a and /, the larger alternative b and g is examined in 
the same way. Either (a, f) or (b, g) may be (0, 0), which means that the corresponding 
part of the search is to be bypassed. When looking at a character in a font, the search 
stops immediately if that character has sufficient height plus depth, or if the character 
is extensible; furthermore, if the character does not stop the search, but if it has a 
successor in the font, the successor is looked at next. (See the METRFONT manual 
or the system documentation of tfm files for further information about successors and 
extensible characters.) If the search runs all the way to completion without finding a 
suitable character, the one with greatest height plus depth is chosen. If no characters 
at all were found (either because a = f = b = g = 0 or because the characters did not 
exist in the fonts), x is set to an empty box whose width is \nulldelimiterspace. If 
an extensible character was found, x is set to a vbox containing enough pieces to build 
up a character of sufficient size; the height of this vbox is the height of the topmost 
piece, and the width is the width of the repeatable piece. Otherwise x is set to an 
libox containing the character that was found; the italic correction of the character is 
included in the width of this box. 

There’s also a subroutine that “reboxes” a given box to a given width. If the 
box doesn’t already have the desired width, TjgX unpackages it (unless it was a vbox), 
then adds a kern for an italic correction if one was implied, and inserts \hss glue at 
both left and right; the resulting horizontal list is packaged into an libox. This process 
is used, for example, to give a common width to the numerator and denominator of a 
fraction; it centers whichever is smaller, unless infinite glue is present in addition to 
the newly added \hss. 

If a; is a box, we shall use the abbreviations h(x), d( x), and w(x) for its height, 
depth, and width, respectively. 

Here now are the rules for typesetting a given math list in starting style C. 
The process applies from left to right, translating each item in turn. Two passes are 
made over the list; most of the work is done by the first pass, which compiles individual 
translations of the math items. We shall consider this part of the task first: 

1. If the current item is a rule or discretionary or penalty or “whatsit” or bound- 
ary item, simply leave it unchanged and move to the next item. 

2. If the current item is glue or a kern, translate it as follows: If it is glue from 
\nonscript, check if the immediately following item is glue or a kern; and if so, remove 
that item if C < S. Otherwise, if the current item is from \mskip or \mkern, convert 
from mu to absolute units by multiplying each finite dimension by yg<7e. Then move on 
to the next item. 

3. If the current item is a style change, set C to the specified style. Delete the 
current item from the list and move on to the next. 

4. If the current item is a four-way choice, it contains four math lists for the four 
main styles. Replace it by the math list that corresponds to the current style C, then 
move to the first unprocessed item. 

5 . If the current item is a Bin atom, and if this was the first atom in the list, or 
if the most recent previous atom was Bin, Op, Rel, Open, or Punct, change the current 
Bin to Ord and continue with Rule 14. Otherwise continue with Rule 17. 


extensible 

successor 

METAFONT 

nulldelimiterspace 

built-up characters 

hss 

nonscript 

mskip 

mkern 
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6 . If the current item is a Rel or Close or Punct atom, and if the most recent 
previous atom was Bin, change that previous Bin to Ord. Continue with Rule 17. 

7. If the current item is an Open or Inner atom, go directly to Rule 17. 

8 . If the current item is a Vcent. atom (from \vcenter), let its nucleus be a vbox 
of height-plus-depth v. Change the height to ~v + a and the depth to \v — a, where 
a is the axis height, <T 22 - Change this atom to type Ord and continue with Rule 17. 

9. If the current item is an Over atom (from \overline), set box x to the nucleus 
in style C' . Then replace the nucleus by a vbox containing kern 6 , hrule of height 9 , 
kern 3 6, and box x , from top to bottom, where 9 = £ s is the default rule thickness. 
(This puts a rule over the nucleus, with 3 9 clearance, and with 9 units of extra white 
space assumed to be present above the rule.) Continue with Rule 16. 

10 . If the current item is an Under atom (from \underline), set box x to the 
nucleus in style C. Then replace the nucleus by a vtop made from box x , kern 3 9, and 
hrule of height 9 , where 9 = £s is the default rule thickness; and add 9 to the depth of 
the box. (This puts a rule under the nucleus, with 3 9 clearance, and with 9 units of 
extra white space assumed to be present below the rule.) Continue with Rule 16. 

11 . If the current item is a Rad atom (from \radical, e.g., \sqrt), set box x to 
the nucleus in style C' . Let 9 = £s; and let ip = <75 if C > T, otherwise tp = 9. Set 
V’ = 9+ \\p[, this is the minimum clearance that will be allowed between box x and the 
rule that will go above it. Set box y to a variable delimiter for this radical atom, having 
height plus depth h(x) -f d(x) + if + 9 or more. Then set 9 <— h(y)\ this is the thickness 
of the rule to be used in the radical construction. (Note that the font designer specifies 
the thickness of the rule by making it the height of the radical character; the baseline of 
the character should be precisely at the bottom of the rule.) If d(y ) > h(x) +d(x) + ip, 
increase i[> by half of the excess; i.e., set 1 /> t— |(i/’ + d(y) — h(x) — d(x)). Construct a 
vbox consisting of kern 9, hrule of height 9, kern i/g and box x , from top to bottom. 
The nucleus of the radical atom is now replaced by box y raised by h(x ) + i/>, followed 
by the new vbox. Continue with Rule 16. 

12 . If the current item is an Acc atom (from \mathaccent), just go to Rule 16 
if the accent character doesn’t exist in the current size. Otherwise set box x to the 
nucleus in style C' , and set u to the width of this box. If the nucleus is not a single 
character, let s = 0 ; otherwise set s to the kern amount for the nucleus followed by the 
\skewchar of its font. If the accent character has a successor in its font whose width 
is < u, change it to the successor and repeat this sentence. Now set S <— min(/i(x), y), 
where \ * s \fontdimen5 (the x-height) in the accent font. If the nucleus is a single 
character, replace box a; by a box containing the nucleus together with the superscript 
and subscript of the Acc atom, in style (7, and make the sub/superscripts of the Acc 
atom empty; also increase <5 by the difference between the new and old values of h(x). 
Put the accent into a new box y, including the italic correction. Let z be a vbox 
consisting of: box y moved right s 4- |(u, — w(y)), kern —5, and box x. If h(z) < h(x), 
add a kern of h(x) — h{z) above box y and set h(z) <— h(x). Finally set w(z) «— w(x), 
replace the nucleus of the Acc atom by box z, and continue with Rule 16. 

13 . If the current item is an Op atom, mark this atom as having limits if it has 
been marked with \limits, or if it has been marked with \displaylimits and C > T . 
If the nucleus is not a symbol, set S <— 0 and go to Rule 13a. Otherwise if C > T and 
if the nucleus symbol has a successor in its font, move to the successor. (This is where 
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operators like ^2 and / change to a larger size in display styles.) Put the symbol into 
a new box x, in the current size, and set <5 to the italic correction for the character; 
include <5 in the width of box x if and only if limits are to be set or there is no subscript. 
Shift box x down by ^(h(x) — d(x)) — a, where a = 022 , so that the operator character 
is centered vertically on the axis; this shifted box becomes the nucleus of the Op atom. 

13a. If limits are not to be typeset for this Op atom, go to Rule 17; otherwise the 
limits are attached as follows: Set box x to the superscript field in style C f; set box y 
to the nucleus field in style C; and set box 2 to the subscript field in style Cf. Rebox 
all three of these boxes to width max(i«(i), w(y ), w(z)). If the superscript field was not 
empty, attach box x above box y , separated by a kern of size max(£g,£n — d(x)), and 
shift box x right by |<5; also put a kern of size £13 above box x. If the subscript field was 
not empty, attach box 2 below box y, separated by a kern of size max( 5 io, £12 — h{z)), 
and shift box 2 left by |(5; also put a kern of size £13 below box 2 . The resulting vbox 
becomes the nucleus of the current Op atom; move to the next item. 

14. If the current item is an Ord atom, go directly to Rule 17 unless all of the 
following are true: The nucleus is a symbol; the subscript and superscript are both 
empty; the very next item in the math list is an atom of type Ord, Op, Bin, Rel, Open, 
Close, or Punct; and the nucleus of the next item is a symbol whose family is the same 
as the family in the present Ord atom. In such cases the present symbol is marked as 
a text symbol. If the font information shows a ligature between this symbol and the 
following one, using the specified family and the current size, then insert the ligature 
character and continue as specified by the font; in this process, two characters may 
collapse into a single Ord text symbol, and/or new Ord text characters may appear. If 
the font information shows a kern between the current symbol and the next, insert a 
kern item following the current atom. As soon as an Ord atom has been fully processed 
for ligatures and kerns, go to Rule 17. 

15. If the current item is a generalized fraction (and it had better be, because 
that’s the only possibility left if Rules 1-14 don’t apply), let 9 be the thickness of 
the bar line and let (A , p) be the left and right delimiters. If this fraction was gen- 
erated by \over or \overwithdelims, then 6 = £g; if it was generated by \atop or 
\atopwithdelims, 6 = 0; otherwise it was generated by \above or \abovewithdelims, 
and a specific value of 9 was given at that time. The values of A and p are null unless 
the fraction is “with delirns.” 

15a. Put the numerator into box x , using style T or T' if G is D or D' , otherwise 
using style C f- Put the denominator into box 2 , using style T' if C > T, otherwise using 
Cf. If w(x) < w(z), rebox x to width w(z); if w{z) < w(x), rebox 2 to width w{x). 

15b. If C > T, set u <— erg and v <— mi- Otherwise set u <— <79 or mo, according 
as 9 yf 0 or 9 = 0, and set v <— m 2 . (The fraction will be typeset with its numer- 
ator shifted up by an amount u with respect to the current baseline, and with the 
denominator shifted down by v, unless the boxes are unusually large.) 

15c. If 9 = 0 (\atop), the numerator and denominator are combined as follows: 
Set (f 7 <— 7£g or 3£g, according as C > T or C < T : ip is the minimum clearance that 
will be tolerated between numerator and denominator. Let </> = (u — d(x)) — (h(z) — v) 
be the actual clearance that would be obtained with the current values of u and v; if 
i/> < i p, add \{(p — -(/’) to both u and v. Then construct a vbox of height h(x) 4- u and 
depth d(z) + v, consisting of box x followed by an appropriate kern followed by box 2 . 
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15d. If 6 ^ 0 (\over), the numerator and denominator are combined as follows: 
Set tp <— 3$ or 9, according as C > T or C < T: tp is the minimum clearance that 
will be tolerated between numerator or denominator and the bar line. Let a = <722 
be the current axis height; the middle of the bar line will be placed at this height. If 
( u — d(x)) — (a + j 9 ) < tp, increase u by the difference between these quantities; and 
if (a — |$) — (h(z) — v) < <p, increase v by the difference. Finally construct a vbox of 
height h(x) + u and depth d{z) + V, consisting of box x followed by a kern followed by 
an hrule of height 8 followed by another kern followed by box s, where the kerns are 
figured so that the bottom of the hrule occurs at a — \8 above the baseline. 

15e. Enclose the vbox that was constructed in Rule 15c or 15d by delimiters whose 
height plus depth is at least < 720 , if C > T, and at least <721 otherwise. Shift the delim- 
iters up or down so that they are vertically centered with respect to the axis. Replace 
the generalized fraction by an Inner atom whose nucleus is the resulting sequence of 
three boxes (left delimiter, vbox, right delimiter). 

Rules 1-15 account for the preliminary processing of math list items; but we still haven’t 
specified how subscripts and superscripts are to be typeset. Therefore some of those 
rules lead to the following post-process: 

16. Change the current item to an Ord atom, and continue with Rule 17. 

17. If the nucleus of the current item is a math list, replace it by a box obtained 
by typesetting that list in the current style. Then if the nucleus is not simply a symbol, 
go on to Rule 18. Otherwise we are in the common case that a math symbol is to be 
translated to its horizontal-list equivalent: Convert the symbol to a character box for 
the specified family in the current size. If the symbol was not marked by Rule 14 above 
as a text symbol, or if \fontdimen parameter number 2 of its font is zero, set (5 to the 
italic correction; otherwise set S to zero. If <5 is nonzero and if the subscript field of the 
current atom is empty, insert a kern of width <f after the character box, and set <f to 
zero. Continue with Rule 18. 

18. (The remaining task for the current atom is to attach a possible subscript and 
superscript.) If both subscript and superscript fields are empty, move to the next item. 
Otherwise continue with the following subrules: 

18a. If the translation of the nucleus is a character box, possibly followed by a kern, 
set u and v equal to zero; otherwise set u t— h — q and v <— d + r, where h and d are 
the height and depth of the translated nucleus, and where q and r are the values of 
<7is and <719 in the font corresponding to styles Cf and Cf. (The quantities u and v 
represent minimum amounts by which the superscript and subscript will be shifted up 
and down; these preliminary values of u and v may be increased later.) 

18b. If the superscript field is empty (so that there is a subscript only), set box x 
to the subscript in style Cf, and add \scriptspace to w(x). Append this box to the 
translation of the current item, shifting it down by max(n, <7i6, h(x) — 1 1 <75 1 ) , and move 
to the next item. (The idea is to make sure that the subscript is shifted by at least v 
and by at least <7ie; furthermore, the top of the subscript should not extend above | of 
the current x-heiglit.) 

18c. Set box x to the superscript field in style C f, and add \scriptspace to w(x). 
Then set u «— max(u, p, d(x) + 1 < 75 1 ) , where p = <713 if C = D, p = <715 if C = C' , and 
p = <7 14 otherwise; this gives a tentative position for the superscript. 
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18d. If the subscript field is empty (so that there is a superscript only), append 
box x to the translation of the current atom, shifting it up by u, and move to the next 
item. Otherwise (i.e., both subscript and superscript are present), set box y to the 
subscript in style Cf, add \scriptspace to w(y ), and set v <— max(t>, an). 

18e. (The remaining task is to position a joint subscript /superscript combination.) 
Let 9 = §s be the default rule thickness. If (u — d(x)) — (h(y) — v) > 46, go to Rule 18f. 
(This means that the white space between subscript and superscript is at least 49.) 
Otherwise reset v so that (u — d(.r>) — ( h(y ) — v) = 49. Let ip = ||<r 5 | — ( u — d(x)). 
If ip > 0, increase u by ip and decrease v by ip. (This means that the bottom of the 
superscript will be at least as high above the baseline as | of the x-height.) 

18f. Finally, let S be zero unless it was set to a nonzero value by Rules 13 or 17. 
(This is the amount of horizontal displacement between subscript and superscript.) 
Make a vbox of height h(x) + u and depth d(y) + v, consisting of box x shifted right 
by d, followed by an appropriate kern, followed by box y. Append this vbox to the 
translation of the current item and move to the next. 

After the entire math list has been processed by Rules 1-18, TjgX looks at the last atom 
(if there was one), and changes its type from Bin to Ord (if it was of type Bin). Then 
the following rule is performed: 

19. If the math list begins and ends with boundary items, compute the maximum 
height h and depth d of the boxes in the translation of the math list that was made on 
the first pass, taking into account the fact that some boxes may be raised or lowered. 
Let a = <722 be the axis height, and let <5 = max(h — a, d+a) be the amount by which the 
formula extends away from the axis. Replace the boundary items by delimiters whose 
height plus depth is at least max( [(5/500J/, 2S — l) , where / is the \delimiterfactor 
and l is the \delimitershortf all. Shift the delimiters up or down so that they are 
vertically centered with respect to the axis. Change the left boundary item to an Open 
atom and the right boundary item to a Close atom. (All of the calculations in this 
step are done with C equal to the starting style of the math list; style items in the 
middle of the list do not affect the style of the right boundary item. ) 

20. Rules 1-19 convert the math list into a sequence of items in which the only 
remaining atoms are of types Ord, Op, Bin, Rel, Open, Close, Punct, and Inner. After 
that conversion is complete, a second pass is made through the entire list, replacing 
all of the atoms by the boxes and kerns in their translations. Furthermore, additional 
inter-element spacing is inserted just before each atom except the first, based on the 
type of that atom and the preceding one. Inter-element spacing is defined by the three 
parameters \thinmuskip, \medmuskip, and \thickmuskip; the mu units are converted 
to absolute units as in Rule 2 above. Chapter 18 has a chart that defines the inter- 
element spacing, some of which is \nonscript, i.e., it is inserted only in styles > S. 
The list might also contain style items, which are removed during the second pass; they 
are used to change the current style just as in the the first pass, so that both passes 
have the same value of C when they work on any particular atom. 

21. Besides the inter-element spacing, penalties are placed after the translation of 
each atom of type Bin or Rel, if the math list was part of a paragraph. The penalty 
after a Bin is \binoppenalty, and the penalty after a Rel is \relpenalty. However, 
the penalty is not inserted after the final item in the entire list, or if it has a numeric 
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value > 10000, or if the very next item in the list is already a penalty item, or after a 
Rel atom that is immediately followed by another Rel atom. 

22. After all of the preceding actions have been performed, the math list has 
been totally converted to a horizontal list. If the result is being inserted into a larger 
horizontal list, in horizontal mode or restricted horizontal mode, it is enclosed by 
“math-on” and “matli-off” items that each record the current value of \mathsurround. 
Or if this list is a displayed formula, it is processed further as explained in Chapter 19. 

Summary of parameter usage. Here is the promised index that refers to everything 
affected by the mysterious parameters in the symbol fonts. Careful study of the rules 
allows you to get the best results by appropriately setting the parameters for new 
fonts that you may wish to use in mathematical typesetting. Each font parameter has 
an external name that is used in supporting software packages; for example, <Ti4 is 
generally referred to as ‘sup2’ and £g as ‘default_rule_thickness\ These external names 
are indicated in the table. 


Parameter Used in Parameter Used in 
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space 

17 
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05 
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11, 18b, 18c, 18e 
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sup -drop 

18a 

06 

quad 
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18a 
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numl 

15b 
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deliml 
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15b 

021 

delim2 

15# 

010 

num3 

15b 

022 

axis-height 

8, 13, 15d, 19 

011 

denoml 

15b 

£a 

default_rule_thickness 

9,10,11,15,15c, 18e 

012 

denom2 

15b 

$9 

big_op_spacingl 

13a 

013 

supl 

18c 

6o 

big_op_spacing2 

13a 

014 

sup2 

18c 

£n 

big_op_spacing3 

13a 

015 

sup3 

18c 

£l2 

big_op_spacing4 

13a 

016 

subl 

18b 

£l3 

big_op_spacing5 

13a 


Besides the symbol and extension fonts (families 2 and 3), the rules above also refer 
to parameters in other families: Rule 17 uses \fontdimen parameter 2 (space) to de- 
termine whether to insert an italic correction between adjacent letters, and Rule 12 
uses parameter 5 (x_height) to position an accent character. Several non-font parame- 
ters also affect mathematical typesetting: dimension parameters \delimitershortf all 
(Rule 19), \nulldelimiterspace (in the construction of variable delimiters for Rules 
11, 15e, 19), \mathsurround (Rule 22), and \scriptspace (Rules 18bcd); integer pa- 
rameters \delimiterf actor (Rule 19), \binoppenalty (Rule 21), and \relpenalty 
(Rule 21); muglue parameters \thinmuskip, \medmuskip, and \thickmuskip (Rule 20). 


Woe to the author who always wants to teach! 
The secret of being a bore is to tell everything. 
— VOLTAIRE, De la Nature de I’Homme (1737) 

Very few Compositors are fond of Algebra, 
and rather chuse to be employed upon plain work. 
— PHILIP LUCKOMBE, The History and Art of Printing (1770) 
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It’s better to break a word with a hyphen than to stretch interword spaces too 
much. Therefore TpX tries to divide words into syllables when there’s no good 
alternative available. 

But computers are notoriously bad at hyphenation. When the type- 
setting of newspapers began to be fully automated, jokes about “therapists who 
pre-ached on wee-knights” soon began to circulate. 

It’s not hard to understand why machines have behaved poorly at this 
task, because hyphenation is quite a difficult problem. For example, the word 
‘record’ is supposed to be broken as ‘rec-ord’ when it is a noun, but ‘re-cord’ 
when it is a verb. The word ‘hyphenation’ itself is somewhat exceptional; if 
‘hy-phen-a-tion’ is compared to similar words like ‘con-cat-e-na-tion’, it’s not 
immediately clear why the ‘n’ should be attached to the ‘e’ in one case but not 
the other. Examples like ‘dem-on-stra-tion’ vs. ‘de-mon-stra-tive’ show that the 
alteration of two letters can actually affect hyphens that are nine positions away. 

A good solution to the problem was discovered by Frank M. Liang dur- 
ing 1980-1982, and TgX incorporates the new method. Liang’s algorithm works 
quickly and finds nearly all of the legitimate places to insert hyphens; yet it 
makes few if any errors, and it takes up comparatively little space in the com- 
puter. Moreover, the method is flexible enough to be adapted to any language, 
and it can also be used to hyphenate words in two languages simultaneously. 
Liang’s Ph.D. thesis, published by Stanford University’s Department of Com- 
puter Science in 1983, explains how to take a dictionary of hyphenated words 
and teach it to TpX : i.e., it explains how to compute tables by which TpX will be 
able to reconstruct most of the hyphens in the given dictionary, without error. 

TgjX hyphenates a given word by first looking for it in an “exception 
dictionary,” which specifies the hyphen positions for words that deserve special 
treatment. If the word isn’t there, T^X looks for patterns in the word, and this 
is the key idea underlying Liang’s method. Here’s how it works, using the word 
‘hyphenation’ as an example, when TpX is operating with the English-oriented 
patterns of plain TgX format: The given word is first extended by special markers 
at either end; in this case we obtain 

.hyphenation. 

if ‘ . ’ denotes the special marker. The extended word has subwords 

. hyphenation . 
of length one, 

•h hy yp ph he en na at ti io on n. 

of length two, 

•hy hyp yph phe hen ena nat ati tio ion on. 

of length three, and so on. Each subword of length A: is a pattern that defines 
k + 1 small integer values relating to the desirability of hyphens in the positions 
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between and adjacent to its letters. We can show these values by attaching them 
as subscripts; for example, ‘oh 0 e 2 n 0 ’ means that the values corresponding to the 
subword ‘hen’ are 0, 0, 2, and 0, where the 2 relates to hyphens between the ‘e’ 
and the ‘n’. The interletter values are entirely zero for all subwords except those 
that match an entry in TgX’s current “pattern dictionary” ; and in this case, only 
the subwords 

ohoY3Poho ohoe2no ohoeo n o a i 

ohoeongaoto inoao o n 2 a o' t o i'toio°o 2io°o o 0 2 a o 

occur as special patterns. TpjX now computes the maximum interletter value 
that occurs at each subword touching each interletter position. For example, 
between ‘e’ and ‘n’ there are four relevant values in this case (2 from 0 hoe 2 n 0 , 

0 from ohoeo n o a 4i 0 from ohoeonsaotcn and 1 from inoao); the maximum of 
these is 2. The result of all the maximizations is 

• ohoy3Pohoe2nsa4t 2 ioo 2 no . 

Now comes the final step: A hyphen is considered to be acceptable between 
two letters if the associated interletter value is odd. Thus, two potential break- 
points have been found: ‘hy-phen-ation’. Similarly, the word ‘concatenation’ 
contains the patterns 

o°2 n o o°o n i c o icoao inoao o n 2 a o' t o itoio°o 2io°o o°2no 

and this yields ‘ocoo 2 nicoaotoein 2 ait 2 ioo 2 no’, i.e., ‘con-cate-na-tion’. 

Let’s try a 34-letter word: ‘supercalifragilisticexpialidocious’ 
matches the plain TgX patterns 

uipe ric ica alii agii gil.-i ilii il-jist isiti st 2 i 
sitic iexp X 3 P pi 3 a 2 iia i 2 al 2 id ido ici 2 io 2 us 

(where subscripts that aren’t shown are zero), and this yields 

• o s o u iPo e o r i c o a oliiof o r o a ogiiol 4 io s i' t 2 io c i e o x 3 P 2 i 3 a ol 2 iidoOic 2 ioo 2 uo s o • 

The resulting hyphens ‘su-per-cal-ifrag-ilis-tic-ex-pi-ali-do-cious’ 
agree with Random House’s Unabridged Dictionary (which also shows a few 
more: ‘su-per-cal-i-frag-i-lis-tic-ex-pi-al-i-do-cious’). 

Plain TgX loads exactly 4447 patterns into TgX’s memory, beginning 
with ‘0 . o a o c oh.i’ and ending with ‘ 4 ziz 2 ’ and ‘oZ 4 Zoyo’- The interletter values in 
these patterns are all between 0 and 5; a large odd value like the 5 in ‘ 0 h 5 eoloOo’ 
forces desirable hyphen points in words like ‘bach-e-lor’ and ‘ech-e-lon’, while 
a large even value like the 4 in ‘ 0 hoaoCoh 4 ’ suppresses undesirable hyphens in 
words like ‘tooth-aches’. Liang derived these patterns by preparing a special 
version of Webster’s Pocket Dictionary (Merriam, 1966) that contains about 
50,000 words including derived forms. Then he checked a preliminary set of 
patterns obtained from this data against an up-to-date hyphenation dictionary of 
about 115,000 words obtained from a publisher; errors found in this run led to the 
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addition of about 1000 words like camp-fire, Af-ghan-i-stan, and bio-rhythm 
to the pocket dictionary list. He weighted a few thousand common words more 
heavily so that they would be more likely to be hyphenated; as a result, the 
patterns of plain TgX guarantee complete hyphenation of the 700 or so most 
common words of English, as well as common technical words like al-go-rithm. 
These patterns find 89.3% of the hyphens in Liang’s dictionary as a whole, and 
they insert no hyphens that are not present. 

Patterns derived from the common words of a language tend to work 
well on uncommon or newly coined words that are not in the original dictionary. 
For example, Liang’s patterns find a correct subset of the hyphens in the word 
that all of today’s unabridged dictionaries agree is the longest in English, namely 

pneu-monoul-tra-mi-cro-scop-ic-sil-i-co-vol-canoco-nio-sis . 

They even do fairly well on words from other languages that aren’t too dis- 
tant from English; for example, the pseudo-German utterances of Mark Twain’s 
Connecticut Yankee come out with only six or seven bad hyphens: 

Con-stanti-nop-o-li-tanis-cher- 

dudel-sack-spf eif en-mach-ers-ge-sellschaf ft ; 
Ni-hilis-ten-dy-na-mitthe- 

aterkaestchensspren-gungsat-ten-taetsver-suchun-gen; 

Transvaal-trup-pen-tropen-trans-port- 

tram-pelth-iertreib-er-trau-ungsthrae-nen-tra-goedie ; 
Mekka-musel-man-nen-massen-menchen- 

mo-er-der-mohren-mut-ter-mar-mor-mon-u-menten-machen. 

But when plain TgX is tried on the name of a famous Welsh city, 

Llan-f air-p-wll-gwyn-gyll-gogerych- 

wyrn-drob-wl-l-l-lan-tysil-i-o-gogogoch, 

linguistic differences became quite evident, since the correct hyphens are 

Llan-f air-pwll-gwyn-gyll-go-ger-y- 

chwyrn-dro-bwll-llan-ty-sil-i-o-go-go-goch. 

Appropriate pattern values for other languages can be derived by applying 
Liang’s method to suitable dictionaries of hyphen points. 

Dictionaries of English do not always agree on where syllable boundaries 
occur. For example, the American Heritage Dictionary says ‘in-de-pend-ent’ 
while Webster’s says ‘in-de-pen-dent’. Plain T^X generally follows Webster 
except in a few cases where other authorities seem preferable. 

[From here to the end of this appendix, TgX will be typesetting with 

\hyphenpenalty=-1000 \pretolerance=-l \tolerance=1000 
\doublehyphendemer it s=- 100000 \f inalhyphendemer it s=- 100000 

so that hyphens will be inserted much more often than usual.] 
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The fact that plain T^X finds only 90% of the permissible hyphen 
points in a large dictionary is, of course, no cause for alarm. When word fre- 
quency is taken into account, the probability rises to well over 95%. Since 
Tf^X’s line-breaking algorithm often succeeds in finding a way to break a para- 
graph without needing hyphens at all, and since there’s a good chance of find- 
ing a different hyphen point near to one that is missed by TgX’s patterns, it is 
clear that manual intervention to correct or insert hyphenations in TgX output is 
rarely needed, and that such refinements take a negligible amount of time com- 
pared to the normal work of keyboarding and proofreading. 

But you can always insert words into T^X’s exception dictionary, if 
you find that the patterns aren’t quite right for your application. For exam- 
ple, this book was typeset with three exceptional words added: The format in Ap- 
pendix E includes the command 

\hyphenation{man-u-script man-u-scripts ap-pen-dix} 

which tells T|;X how to hyphenate the words ‘manuscript’, ‘manuscripts’, and 
‘appendix’. Notice that both singular and plural forms of ‘manuscript’ were en- 
tered, since the exception dictionary affects hyphenation only when a word agrees 
completely with an exceptional entry. (Precise rules for the \hyphenation com- 
mand are discussed below.) 

If you want to see all of the hyphens that plain Tp^I will find in some ran- 
dom text, you can say ‘\showhyphens{ (random text)}’ and the results will ap- 
pear on your terminal (and in the log file). For example, 

*\showhyphens {random manuscript manuscripts appendix} 

Underfull \hbox (badness 10000) detected at line 0 

[] \tenrm ran-dom manuscript manuscripts ap-pendix 

shows the hyphen positions that would have been found in this book with- 
out the addition of any \hyphenation exceptions. Somehow the word ‘man- 
uscript’ slips through all of the ordinary patterns; the author added it as an ex- 
ception for this particular job because he used it 80 times (not counting its ap- 
pearances in this appendix). 

The \showhyphens macro creates an libox that is intentionally underfull, 
so you should ignore the warning about ‘badness 10000’; this spurious message 
comes out because TpjX displays hyphens in compact form only when it is dis- 
playing the contents of anomalous hboxes. (TgX wizards may enjoy study- 
ing the way \showhyphens is defined in Appendix B.) 

f lf you want to add one or more words to the exception dictionary, just 
say \hyphenation{ (words)} where (words) consists of one or more (word) 
items separated by spaces. A (word) must consist entirely of letters and hy- 
phens; more precisely, a “hyphen” in this context is the token - 12 . A “let- 
ter” in this context is a character token whose category code is 11 or 12, or a con- 
trol sequence defined by \chardef, or \char(8-bit number), such that the correspond- 
ing character has a nonzero \lccode. TjgX uses the \lccode to convert each let- 
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ter to “lowercase” form; a word-to-be-hyplienated will match an entry in the excep- 
tion dictionary if and only if both words have the same lowercase form after conver- 
sion to lowercase. 

f TfrX will henceforth insert discretionary hyphens in the specified positions, 
whenever it attempts to hyphenate a word that matches an entry in the excep- 
tion dictionary, except that hyphens are never inserted after the very first letter or be- 
fore the last or second-last letter of a word. You must insert your own discretionary hy- 
phens if you want to allow them in such positions. A \hyphenation entry might con- 
tain no hyphens at all; then TjrX will insert no hyphens in the word. 

f Tlie exception dictionary is global; i.e., exceptions do not disappear at the end 
of a group. If you specify \hyphenation of the same word more than once, its 
most recently specified hyphen positions are used. 

/>)/>) The exception dictionary is dynamic, but the pattern dictionary is static: To 
JL JL change TjjX’s current set of hyphenation patterns, you must give an entirely 
new set, and TjgX will spend a little time putting them into a form that makes the hy- 
phenation algorithm efficient. The command format is \patterns{ (patterns)}, where 
(patterns) is a sequence of (pattern) items separated by spaces. This command is avail- 
able only in INITEX, not in production versions of IjgX, since the process of pattern com- 
pression requires extra memory that can be put to better use in a production sys- 
tem. INITEX massages the patterns and outputs a format file that production ver- 
sions can load at high speed. 

A (pattern) in the \patterns list has a more restricted form than a 
JL JL (word) in the \hyphenation list, since patterns are supposed to be pre- 
pared by experts who are paid well for their expertise. Each (pattern) con- 
sists of one or more occurrences of (value) (letter) , followed by (value). Here (value) is ei- 
ther a digit (0 12 to 9i2) or empty; an empty (value) stands for zero. For ex- 
ample, the pattern ‘ 0 aibo’ can be represented as OalbO or albO or Oalb or sim- 
ply alb. A (letter) is a character token of category 11 or 12 whose \lccode is 
nonzero. If you want to use a digit as a (letter), you must precede it by a nonempty 
(value); for example, if for some reason you want the pattern ‘laoii’ you can ob- 
tain it by typing ‘la012’, assuming that \lccode‘l is nonzero. Exception: The char- 
acter ‘ ’ is treated as if it were a (letter) of code 0 when it appears in a pat- 
tern. Code 0 (which obviously cannot match a nonzero \lccode) is used by T}?X to rep- 
resent the left or right edge of a word when it is being hyphenated. 

T^ a ' n T)?X inputs a file called hyphen.tex that sets up the pattern dictio- 
nary and the initial exception dictionary. The file has the form 


\patterns{ . ach4 . ad4der .aflt .al3t ••• zte4 4zlz2 z4zy} 
\hyphenation{as-so-ciate as-so-ciates dec-li-na-tion oblig-a-tory 
phil-an-thropic present presents project projects reci-procity 
re-cog-ni-zance ref-or-ma-tion ret-ri-bu-tion ta-ble} 

The first thirteen exceptions keep TjgX from inserting incorrect hyphens; for ex- 
ample, ‘pro-ject’ and ‘pre-sent’ are words like ‘re-cord’, that cannot be hy- 
phenated without knowing the context. The other exception, ‘ta-ble’, is in- 
cluded just to meet the claim that plain TjrX fully hyphenates the 700 or so most com- 
mon words of English. 


lowercase 

group 

patterns 

INITEX 

digit 



454 Appendix H: Hyphenation 


/gK/gK But how does TgX decide what sequences of letters are “words” that should 
JL JL be hyphenated? Let’s recall that TjgX is working on a horizontal list that con- 
tains boxes, glue, rules, ligatures, kerns, discretionaries, marks, whatsits, etc., in addi- 
tion to simple characters; somehow it has to pick out things to hyphenate when it is un- 
able to find suitable breakpoints without hyphenation. The presence of punctua- 
tion marks before and/or after a word should not make a word unrecognizable or unhy- 
phenatable; neither should the presence of ligatures and kerns within a word. On the 
other hand, it is desirable to do hyphenation quickly, not spending too much time try- 
ing to handle unusual situations that might be hyphenatable but hard to recog- 
nize mechanically. 

/gL/gL TgX looks for potentially hyphenatable words by searching ahead from each 
JL JL glue item that is not in a math formula. The search bypasses charac- 
ters whose \lccode is zero, or ligatures that begin with such characters; it also by- 
passes whatsits and implicit kern items, i.e. , kerns that were inserted by T^X it- 
self because of information stored with the font. If the search finds a charac- 
ter with nonzero \lccode, or if it finds a ligature that begins with such a charac- 
ter, that character is called the starting letter. But if any other type of item oc- 
curs before a suitable starting letter is found, hyphenation is abandoned (until af- 
ter the next glue item). Thus, a box or rule or mark, or a kern that was explicitly in- 
serted by \kern or \/, must not intervene between glue and a hyphenatable word. If 
the starting letter is not lowercase (i.e., if it doesn’t equal its own \lccode), hyphen- 
ation is abandoned unless \uchyph is positive. 

/gL/gL If a suitable starting letter is found, let it be in font /. Hyphenation is aban- 
JL JL doned unless the \hyphenchar of / is between 0 and 255, and unless a charac- 
ter of that number exists in the font. If this test is passed, TJlX continues to scan for- 
ward until coming to something that’s not one of the following three “admissi- 
ble items”: (1) a character in font / whose \lccode is nonzero; (2) a ligature formed en- 
tirely from characters of type (1); (3) an implicit kern. The first inadmissible item ter- 
minates this part of the process; the trial word consists of all the letters found in ad- 
missible items. Notice that all of these letters are in font /. 

/gL/gL If a trial word h . . A n lias been found by this process, hyphenation will still 
JL JL be abandoned unless n > A + p, where A = max(l, \lefthyphenmin) and 
p = max(l, \righthyphenmin). (Plain TgX takes A = 2 and p = 3.) Further- 
more, the items immediately following the trial word must consist of zero or more char- 
acters, ligatures, and implicit kerns, followed immediately by either glue or an ex- 
plicit kern or a penalty item or a whatsit or an item of vertical mode ma- 
terial from \mark, \insert, or \vadjust. Thus, a box or rule or math for- 
mula or discretionary following too closely upon the trial word will inhibit hyphen- 
ation. (Since TgX inserts empty discretionaries after explicit hyphens, these rules im- 
ply that already-hyphenated compound words will not be further hyphenated by the al- 
gorithm. ) 

/gL/gL Trial words h . . . l n that pass all these tests are submitted to the hy- 
JL JL plrenation algorithm described earlier. Hyphens are not inserted before 
l\ or after /„+ i_ p . If other hyphenation points are found, one or more discre- 
tionary items are inserted in the word; ligatures and implicit kerns are reconsti- 
tuted at the same time. 


ligatures 

kerns 

implicit kern 

kern 

uchyph 

hyphenchar 

lefthyphenmin 

righthyphenmin 

mark 

insert 

vadjust 

explicit hyphens 



Appendix H: Hyphenation 


455 


/>)/>) Since ligatures and kerns are treated in quite a general manner, it’s pos- 
JL JL sible that one hyphenation point might preclude another because the lig- 
atures that occur with hyphenation might overlap the ligatures that occur with- 
out hyphenation. This anomaly probably won’t occur in real-life situations; there- 
fore T)gX’s interesting approach to the problem will not be discussed here. 

/>)/>) According to the rules above, there’s an important distinction between im- 
JL JL plicit and explicit kerns, because TgX recomputes implicit kerns when 
it finds at least one hyphen point in a word. You can see the difference be- 
tween these two types of kerns when TgX displays lists of items in its internal for- 
mat, if you look closely: ‘\kern2 . 0’ denotes an implicit kern of 2 pt, and ‘\kern 2 . 0’ de- 
notes an explicit kern of the same magnitude. The italic correction command \/ in- 
serts an explicit kern. 

/>)/>) The control sequence \- is equivalent to \discretionary{\char 
JL JL where h is the \hyphenchar of the current font, provided that h lies be- 
tween 0 and 255. Otherwise \- is equivalent to \discretionary{}{}{}. 

So far we have assumed that T^X knows only one style of hyphenation at a 
JL JL time; but in fact 1)?X can remember up to 256 distinct sets of rules, if you have 
enough memory in your computer. An integer parameter called \language selects the 
rules actually used; every \hyphenation and \patterns specification appends new rules 
to those previously given for the current value of \language. (If \language is nega- 
tive or greater than 255, TpX acts as if \language = 0.) All \patterns for all lan- 
guages must be given before a paragraph is typeset, if INITEX is used for typesetting. 

TjrX is able to work with several languages in the same paragraph, be- 
JL JL cause it operates as follows. At the beginning of a paragraph the “cur- 
rent language” is defined to be 0. Whenever a character is added to the cur- 
rent paragraph (i.e. , in unrestricted horizontal mode), the current language is com- 
pared to \language; if they differ, the current language is reset and a whatsit node spec- 
ifying the new current language is inserted before the character. Thus, if you say 
‘\def\french{\languagel . . . }’ and ‘mix {\french franc/ais} with English’, T^X 
will put wliatsits before the f and the w; hence it will use language 1 rules when hy- 
phenating franc/ais, after which it will revert to language 0. You can insert the what- 
sit yourself (even in restricted horizontal mode) by saying \setlanguage(number); 
this changes the current language but it does not change \language. Each what- 
sit records the current \lefthyphenmin and \righthyphenmin. 


internal format 
italic correction 

hyphenchar 

language 

hyphenation 

patterns 

setlanguage 

French 

JUSTUS 

TIMES TRIBUNE 


If all problems of hyphenation have not been solved, 
at least some progress has been made since that night, 
when according to legend, an RCA Marketing Manager received 
a phone call from a disturbed customer. His 301 had Just hyphenated "God." 
— PAUL E. JUSTUS, There's More to Typesetting Than Setting Type (1972) 

The committee skeptically re- 
commended more study for a bill 
to require warning labels on rec- 
ords with subliminal messages re- 
corded backward. 

— THE PENINSULA TIMES TRIBUNE (April 28, 1982) 



Index 
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The author has tried to provide as complete an index as possible, so that people 
will be able to find things that are tucked away in obscure corners of this long 
book. Therefore the index itself is rather long. A short summary of the simpler 
aspects of TgX appears at the beginning of Appendix B; a summary of special 
symbols appears at the end of Appendix F; a summary of other special things 
appears under ‘tables’ below. 

Page numbers are underlined in the index when they represent the definition 
or the main source of information about whatever is being indexed. (Underlined entries 
are the most definitive, but not necessarily the easiest for a beginner to understand.) 
A page number is given in italics (e.g., ‘ 123 ’) when that page contains an instructive 
example of how the concept in question might be used. Sometimes both underlining 
and italics are appropriate. When an index entry refers to a page containing a relevant 
exercise, the answer to that exercise (in Appendix A) might divulge further information; 
an answer page is not indexed here unless it refers to a topic that isn’t included in the 
statement of the relevant exercise. 

Control sequence names that are preceded by an asterisk (*) in this index are 
primitives of T^X; i.e., they are built in. It may be dangerous to redefine them. 


u (visible space), 3, 420, 429; 
see also (space) , spaces. 

*\ u (control space), 8 , 10, 19, 73, 74, 

86-87, 154, 163 , 167, 283, 285, 

290 , 323, 351, 381. 

it (hash mark), 38, 51, 113, 200-202, 203 , 
204-205, 228, 235, 236-240. 

\# ( # ), 38, 51, 356- 

##, 203-205, 228, 359-362, 378-379. 

###, 88 . 

#{, 204, 401. 

$ (dollar sign), 4, 38, 51, 54, 86-88, 92, 127, 
134-135, 185-186, 269, 283, 287, 293. 

\$ ( $ ), 38, 51, 202, 309, 356- 

$$, 86-89, 185, 186-196, 232, 287, 293, 
375-376, 421. 

7. (percent sign), 26, 38, 39, 43, 48, 51, 113, 
124 , 249, 337, 340, 343. 

\7 ( % ), 38, 43-44, 51, 356. 

77, 112-113. 

k (ampersand), 38, 51, 175-177, 190-196, 
231-2.{8, 282, 385-386. 
for preloaded formats, 25, 26, 344. 

\fc ( & ), 38, 51, 53, 356. 

kk, 241-242, 361, 412. 

’ (apostrophe or right quote), 3-5, 51, 130, 
155, 201, 305, 324, 357, 394-395; 
see also octal. 

V (acute accent), 7-9, 52-53, 305, 

335, 356, 420. 

’ ’ ( ” ), 3-5, 24, 394-395. 


' (reverse apostrophe or left quote), 3-5, 51, 
132, 134, 305, 391, 394-395; 
see also alphabetic constant. 

V (grave accent), 8, 52-53, 305, 356 . 

“ ( “ ), 3-5, 24, 394-395. 

" (double quote or ditto mark), 52, 53, 

134; see also hexadecimal. 

\" (dieresis or umlaut accent), 7, 9, 24, 

25, 52-53, 55, 356 . 

( (left parenthesis), 51, 134, 140, 

145-150, 345. 

\(, 409. 

) (right parenthesis), 51, 134, 140, 

145-150, 345. 

[ (left bracket), 51, 134, I46-I48, 

171, 408, 437. 

[] , 28, 79, 302. 

[1], 23, U9. 

] (right bracket), 51, 134, 146-147, 

171, 345, 408, 437. 

{ (left brace), 13-l.f, 19-21, 38, 51, 

200-202, 203-204. 205-206, 216, 269, 
275-276, 283, 286, 291, 330. 

\{ ( { ), 134, 146-147, 174-175, 361- 

O, 19, 54, 82, 95, 114, 129, 130, 150, 

169, 196, 242, 253, 262, 305, 315, 

318, 351, 393. 

> (right brace), 13-14, 19-21, 38, 51, 
200-202, 203-204, 205-206, 269, 
275-276, 279, 301, 330. 

\> ( } ), 134, 146-147, 174-175, 361. 
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+ (plus sign), 51, 132, 268. 

\+ (begin tabbed line), 231-234 , 249, 

339, 354 . 

- (hyphen or minus), 4, 51, 93, 95, 

127, 132, 268. 

*\- (discretionary hyphen), 95, 283, 

287, 292, 455. 

— , , see en-dash, em-dash. 

±, see \pm. 
see \mp. 

* (asterisk), 23, 25, 51, 99, 113, 116, 

132-133 , 154, 326. 

\* (discretionary x), 173, 357 . 

**, 23, 25, 344. 

/ (slash), 51, 132, 146-147, 320, 430. 

*\/ (italic correction), 14, 64 , 287 , 292 , 

306, 382, 455. 

I (vertical line), 52, 53, 132, I 46 -I 47 , 

171, 174, 438. 

\l ( || ), 146-147, 171, 361, 435, 438. 

\ (backslash), 7, 38, 39, 40, 51, 146-147, 

343 , 436. 

\\, 38, 378, 418. 

< (less than sign), 52, 53, 133, 150, 154, 209. 
<, 45, 135, 369; see also \le. 

= (equals sign), 51, 133, 209, 226, 275, 376. 
\= (macron accent), 52, 53, 356 . 

d = , 181, 437. 

45, 135, 369; see also \ne. 

> (greater than sign), 52, 53, 133, 150, 209. 
\> (medium space), 167, 171, 357 . 

>, 45, 135, 369; see also \ge. 

(), see angle brackets. 

, (comma), 51, 72-73, 134, 161-162, 
172-174 , 394-395, 430. 

\, (thin space), 5, 167-173, 305, 357 , 
409-410. 

. (period), 51, 72-73, 133-134, 149, 161, 
345 , 394 -395, 430. 
space after, 73-75 ; 76. 

\. (dot accent), 52, 356 . 

. . . , see \ldots, ellipses. 

• • • , see \cdots, ellipses. 

: , see \vdots, ellipses. 

; (semicolon), 51, 134, 161. 

\; (thick space), 167, 171, 357 . 

: (colon), 51, 133-134, 155, 161, 174, 438. 
:=, 133. 

? (question mark), 31, 51, 73, 161. 

I (open question), 51. 

! (exclamation point), 51, 72, 73, 75, 169. 

; (open exclamation), 51. 

\! (negative thin space), 167 , 169, 357 . 


_ (underscore), 38, 51, 128-130, 134. 

\_ ( . ), 38, 165, 356. 

~ (hat), 38, 51, 128-130, 134, 369, 423. 

\~ (circumflex accent), 52-53, 356 . 

", 45, 47, 48, 368, 370, 423. 

"M (ASCII (return)), 45, 249, 331, 343, 345, 
348, 352, 380, 390-391, 421, 423. 

\“M, 8, 305, 351 . 

(tilde), 38, 51, 343, 353 ; see also ties. 

\~ (tilde accent), 52, 356 , 387. 

U, 135, 343, 368, 429; 

see also \uparrow, \dounarrou. 

® (at sign), 51, 98-99, 132, 134, 344, 

364 , 4 08, 414. 
a®, 98-99. 

\ane, 345 . 

\aa ( a ), 356 . 

\AA ( A ), 356 . 

abbreviations, 73-74, 340', see also macros. 
*\above (general fraction), 143, 152, 

292 , 444-445. 

*\abovedisplayshortskip, 189 , 274, 

348, 4 1 5. 

*\abovedisplayskip, 189 , 190 . 194, 274, 

291, 348, 4 1 5. 

*\abovewithdelims, 152, 292 , 444—445. 
absolute value, 146 , 149, 171, 175. 

Acc atom, 158, 289, 443. 

*\accent (general accent), 9, 54, 86, 283, 286 . 
accents (' etc.), 7, 52-53, 339, 356 , 

357, 427-429. 
as ligatures, 46, 54. 
in math, 135-137, 141, 164-165, 

359 . 435, 443 . 
on top of accents, 136. 
table, 52, 135, 339. 

\active (category 13), 241, 343 , 395, 421- 
active character, 37, 40, 209, 241, 307, 

377, 380-381, 394-395. 
active math character, 155, 289 . 
active spaces, 381, 394, 421. 

\acute (math accent: x), 135, 359 . 
acute accent ('), see V, \acute. 

\address, 4 03-404 , 407 . 

*\adjdemerits, 98, 273, 314, 348. 

*\advance, 21, 118-119, 218, 256, 276, 355. 
\advancepageno, 256, 257, 362 , .(16. 

\ae ( as ), vii, 17, 45-46, 52-53, 239, 356- 
\AE ( M ), 52-53, 356 . 

*\afterassignment, 215, 279 . 352, 

364 , 376, 401. 

*\aftergroup, 215, 279, 363 , 374, 377, 379. 
‘ain, see reverse apostrophe. 
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al-Khwarizmi, abu Ja‘far Muhammad 
ibn Musa, 53. 

\aleph ( N ), 9, 358 , 435. 

Alice, 4, 387, 394. 

alignment displays, 190, 193, 291 . 

(alignment material), 282, 285. 
alignments, 231-249, 282, 302-303, 385-386, 
392; see also tabbing. 

Alka-Seltzer, 404-405. 
all caps, see \ uppercase. 

Allen, Todd Andrew, 377. 
allocation, 121-122, 346 , 347 . 
\allocationnumber, 346. 

\alloubreak, 1 74, 353 . 396. 

\allowhyphens, 394, 395 . 

\alpha ( a ), 127, 201, 358, 434. 

\Alpha, 434. 

alphabetic constants, 44~46, 215, 269 , 

270, 309, 367, 385. 
alternatives, see \cases. 

\amalg ( II ), 358, 436. 

American Mathematical Society, ii, vii. 
ampersand, 25, 38, 51, 175-177, 190-196, 
231-2.{8, 282, 344, 385-386, 428. 
AmS-TeX, 164, 420, 434. 
anatomy of TgX, 38-39, 46, 85, 267 , 349, 
373, 379, 385, 386, 456 . 

\angle ( Z ), 358 , 435. 
angle brackets ( () ), 59, 146-147, 150, 268 , 
420, 437; see also Mangle, \rangle. 
angstrom unit, see \AA. 

\annotations, 403, 404, 407 . 
answers to the exercises, 305-337. 
Antisthenes of Athens, 239. 
apostrophe, 3-5, 51, 130, 155, 201, 324; 

see also octal. 

\approx ( ~ ), 128, 436. 

Arabic, 66. 

\arccos ( arccos ), 162, 361 . 

Archytas of Taras, 239. 

\arcsin ( arcsin ), 162, 361. 

\arctan ( arctan ), 162, 361. 

Arenskii, Anton Stepanovich, 410. 

\arg ( arg ), 162, 361. 

arguments, 33, 200-205, 263, 268, 375-380. 

Aristippus of Gyrene, 239. 

Aristophanes, 239. 

Aristotle, 35. 

arithmetic, 117-119, see \advance, 
Multiply, \divide. 

(arithmetic), 276 . 

arrays, 176-178, see matrices. 

arrows, 1 46-1 47, 182, 226, 437 . 


\arrowvert ( | ), 150, 359 . 

\Arrowvert ( || ), 150, 359 . 

The Art of Computer Programming, 
259-260. 

as is, see \obeylines, \obeyspaces, 
verbatim. 

ASCII, 3, 43-45, 49, 214, 343, 36L 371. 
(assignment), 275 . 
assignments, 275-278, 373. 

\ast ( * ), 436. 

asterisk, 23, 25, 51, 99, 113, 116, 132-133, 
154, 326. 

\asymp ( X ), 436. 

at, 16-17, 60, 213, 277, 408, 414, 433. 

(at clause), 277 . 

at sign, 51, 98-99, 132, 134, 344 , 364, 

40 8, 414. 

AT&T, 247. 

atoms, 157-159, 170-171, 289-290, 441-447. 
table of atomic types, 158. 

*\atop, 143, 145 , 152, 178, 292 , 444 . 
*\atopwithdelims, 152, 292 , 324, 360, 444. 
author, typesetting by, 182, 412—413. 
auxiliary spaces, see ties, 
axis line, 150-152, 179, 443-447. 

\b (bar-under accent), 52, 356 . 

Bach, Johann Sebastian, 408. 

Bach, P. D. Q., 410-411, 481. 
backslash, 7, 38, 39, 40, 51, 146-147, 

343, 436. 

\backslash ( \ ), 38, 146-147, 359 , 435, 436. 
backspacing, 66, 82-83, 222, 394-395, 418. 
Backus, John Warner, 268. 

Bacon, Francis, viscount St. Albans, 41. 

Bacon, Leonard, 1. 

bad breaks, avoiding, 27-30, 91-94, 

173-174, 197. 

badness, 28-30, 97-99, 111-113, 302. 
*\badness, 214, 229, 271. 

(balanced text), 275, 276 , 385. 
balancing columns, 386-388, 396-397, 417. 
\bar (math accent: ;r), 135, 136. 
bar accent (~), see \=, \bar. 
bar-under accent (_), see \b. 

Barrett, Percy Reginald, 197. 

Barrough, Philip, 229. 
baseline, 15, 63, 77, 80-81, 150. 
*\baselineskip (normal vertical distance 
between baselines), 78-79, 80, 104, 

194, 253, 256, 274, 281, 342, 349, 
351-352, 409, 414-415. 

*\batchmode, 32, 277, 299, 336. 

Batey, Charles, 197. 
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beauty, 1. 

Beck, Simone, 233, 236. 

Beethoven, Ludwig van, 408, 410-411. 
Beeton, Barbara Ann Neuhaus Friend 
Smith, 483. 

\beginchapter, 418. 

*\begingroup, 21, 249, 262 , 279 . 380, 

407, 419. 

\beginsection, 340-34 1 355 . 
*\beloudisplayshortskip, 189 . 274, 

348, 4 1 5. 

*\belowdisplayskip, 189 , 190 , 194, 274, 
291, 348, 4 1 5. 

Berner, Robert William, see TEX, ASCII, 
bent bars, see angle brackets. 

Bertholle, Louisette, 233, 236. 

\beta (/3 ), 127, 434. 

\Beta, 434. 

\bf (use boldface type), 13-14, 164-165, 
328, 351, 409, 414-415. 

\bffam, 351 . 

\bgroup (implicit {), 269, 351 , 363, 

382, 385, 407, 421. 

Bibby, Duane Robert, i. 

Biblical references, 303, 311. 
bibliographies, 4, 7.(, 93, 340-341- 
\big (largish delimiter), 147, 171, 320, 

359, 360, 414-415. 

\Big (between \big and \bigg), 147, 

175, 359, 360 . 

big-O notation, 132, 161-162, 169. 
big point, 57, see bp. 

\bigbreak, 111, 116, 353 . 363. 

\bigcap (large fl), 147, 435. 

\bigcirc ( Q ), 436. 

\bigcup (large U), 147, 435. 

\bigg (large delimiter), 147, 175, 196, 

327, 359, 360- 

\Bigg (larger than \bigg), 147, 175, 

359, 360. 

\bigggl, \bigggr, 324. 

\biggl (\bigg left delimiter), 147, 

149, 359, 437. 

\Biggl (\Bigg left delimiter), 147, 

149, 359, 437. 

\biggm (\bigg middle delimiter), 147, 359 . 
\Biggm (\Bigg middle delimiter), 147, 359 . 
\biggr (\bigg right delimiter), 147, 

149, 359- 

\Biggr (\Bigg right delimiter), 147, 

149, 359 . 

\bigl (\big left delimiter), 146-147, 
149-150, 155, 171, 175, 359, 437. 


\Bigl (\Big left delimiter), 147, 359 , 437. 
\bigm (\big middle delimiter), 147, 

359 , 171, 175. 

\Bigm (\Big middle delimiter), 147, 359 . 
\bigodot (large 0), 435. 

\bigoplus (large 0 ), 435. 

\bigotimes (large 0 ), 435. 

\bigr (\big right delimiter), 146-1.(7, 
149-150, 171, 175, 359- 
\Bigr (\Big right delimiter), 147, 359 . 
\bigskip, 70, 109, 111, 115-116, 352, 

355, 407, 410-412. 

\bigskipamount, 123, 349 , 352-353, 363. 
\bigsqcup (large U), 435. 

\bigtriangledown ( V )> 436. 
\bigtriangleup ( A ), 436. 

\bigtype, 408-409, 411. 

\biguplus (large 0), 358 , 435. 

\bigvee (large V), 435. 

\bigwedge (large A), 435. 

Bill, 387, 394. 

Bin atom, 158, 170-171, 289, 442-444, 446. 
binary operations, 132-133, 154-155, 164, 
196, 358, 435, 436 ; see also Bin atom, 
binary search, 387-388. 
binomial coefficient, 143, see \choose. 
*\binoppenalty, 101, 174, 272, 322 , 348, 446. 
black box, 64, 221, 222 . 
blackboard bold (e.g., Hi), 164, 434. 
blank line in input file, 24, 37, 47 , 

340-341, 381. 
blank space, see spaces. 

Blase Bohning, Maria Dorothea, 248. 
block structure, see grouping, 
block style, 405-407. 

\bmit (boldface math italic), 156. 

\bmod ( mod ), 164, 322, 361 . 

\body, 403-404, 407 . 

Boehm, Peter James, 159. 

Bohning, Jobst Heinrich, 248. 

Bohning, Martin John Henry, 248. 

Bohning Knuth, Louise Marie, 248. 
boldface, 13, 156, 164-165, 386. 
book design, 412. 
book preparation, 303, 425. 

\bordermatrix, 177, 361 . 

\bot ( _L ), 435. 

*\botmark, 213, 258, 259-260, 262-263, 280. 
boundary item, 157, 442, 446. 

Bourbaki, Nicolas, 106. 

\bowtie ( 1 x 1 ), 358 , 436. 

(box), 120, 222, 278, 282, 285, 290. 
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*\box (use box register), 120-122, 151, 222, 
278 , 346, 354 , 386 , 387. 

\box255, 125, 253-258. 

(box dimension), 271 . 277. 

box displays, 66, 75, 79, 158-159, 302, 455. 

box memory, 300, 394. 

(box or rule), 281 . 

(box size assignment), 277 . 

(box specification), 222, 278 . 

|boxed| control sequence names, 38. 
boxed material, 223 , .(20. 
boxes, 63-67, 77-83, 221-229. 
*\boxmaxdepth, fU, 113, 249 , 255 , 274 , 348. 
bp (big point), 57, 270. 

\brace (notation like {",}), 360 . 

\braceld, \bracelu, \bracerd, \braceru 
(pieces of horizontal braces), 357 . 
braces, 51, 216, 269, 275-276, 279, 283, 286, 
289-291, 330, 345, 385-386. 
for arguments to macros, 20, 200-202, 
203 , 204 . 205-206, 385-386. 
for grouping, 13-14, 19-21, 232, 248, 253. 
horizontal, 176, 225-226, 339. 
implicit, 269 . see \bgroup, \egroup. 
in math formulas, 145-147, 174-176. 
\bracevert ( | ), 150, 359 . 

\brack (notation like [))]), 360 . 
brackets, 51, 134, I 46 -I 48 , 171, 408, 437. 
\break (force line or page break), 94, 

97, 106, 114, 193, 353 . 
breakpoints, 96, 97-100, 110 , 111-114, 394. 
avoiding bad, 27-30, 91-94, 109 III. 
173-174, 197. 

discretionary, 95-96, 173, 287, 292, 357. 
forcing good, 94, 105, 109-111, 114. 
in displays, 195-197. 
in formulas, 173-174, 446-447. 

\breve (math accent: x), 135. 
breve accent (”), see \u, \breve. 

British pound sign, 54. 
brochures, 251. 

*\brokenpenalty, 104 , 105, 272, 348. 

Brooks, Frederick Phillips, Jr., 365. 

Brown, Peter John, 425. 

\buildrel, 361 . 437. 

built-up (extensible) characters, 442. 

built-up fractions, see \over. 

\bull ( ■ ), 420 . 

Bull, John, 239. 

\bullet ( • ), 133, 154, 355, 436. 
bulleted lists, 102, 105. 
business correspondence, 200, 403-408. 
by, 118, 276. 


\bye, 87-88 , 340, 357 . 

Byron, George Gordon Noel Byron, 
baron, vii. 

\c (cedilla accent), 24-25, 52, 356 . 

\cal (calligraphic caps), 164, 351 , 431, 434. 
calculus, 168-169, 180-181. 
camera alignment, 416-417. 

\cap ( fl ), 133, 436. 

capacity of Tp)X, 100, 300-301, 383. 

caps and small caps, 203. 

captions, 115. 

caret, 369. 

caron, see hacek. 

carriage return, 231, see (return), \cr. 
\cases ( { ' ' ), 175, 362 . 

Caswell, Herbert Ernest, 413. 

*\catcode, 39, 134, 214 , 271, 305 , 343, 
380-382, 384, 390-391, .{21, .{2.{. 
category codes, 37-40, 47, 48, 203-205, 
209-210, 214, 381. 
table, 37. 

cc (cicero), 57, 270. 

\cdot ( • ), 133, 172, 319, 436. 

\cdotp, 358, 359 . 438. 

\cdots (•••), 172, 176, 180-181, 359, 438. 
cedilla accent Q, 25, 52, 54, see \c. 
ceiling brackets ( ["] ), 146-147, see \lceil, 
\rceil. 

centering, 71, 233, 236. 

\centering, 347 . 348, 362. 

\centerline (make a centered line), 20, 24 , 
33, 71, 85, 101, 117, 232, 311, 340, 353 . 
\cents ( <4 ), 140, 319 . 

Cesaro, Ernesto, 53. 

Chalkovskii, Petr Il’ich, 410-411. 

*\char, 43-46, 76, 86, 155, 283, 286, 

289, 340, .{27, 452. 

(character) , 289 . 

character codes, 43-46, 367-370; 

see also category codes. 

(character token) , 270 . 

*\chardef, 44, 121, 155, 210, 214, 215, 272, 
277 . 336 , 343, 345, 452. 

(chardef token), 271 , 272. 

Charles XII of Sweden, 92. 

Chaundy, Theodore William, 197. 

\check (math accent: i), 135. 
check accent ('), see \v, \check. 
chemical typesetting, 179. 

Cherry, Lorinda Landgraf, 159. 

\chi ( X ), 1, 434. 

Chicago, University of, Press, 125, 

293, 418, 439. 
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Child, Julia, 233, 236. 

Children’s Television Workshop, ii. 
choice, four-way, 157, 292 . 442 . 

\choose (notation like (£)), 139, 143, 

152, 178, 360. 

Christie Mallowan, Dame Agatha Mary 
Clarissa (Miller), 249. 
cicero, see cc. 

\circ ( o ), 133, 323, 436. 
circles, see \circ, \bigcirc. 
circular quotation, 101. 
circumflex, 369, see hat. 
circumflex accent ('), 52, 356 , see \~. 
classes of math characters, table, 154. 
*\cleaders, 224, 225-226, 357, 371 
\cleartabs, 234, 354 . 

Close atom, 158, 170-171, 289, 443-444, 446. 
*\closein, 217, 280. 

*\closeout, 226-228, 254, 280, 422. 

\closing, 4 03-404 , 407 . 

closings, 134, 147, 154-155, 359, 437; 

see also Close atom, 
club lines, 104, 272, 398. 

*\clubpenalty, 104, 113, 272, 317, 348, 419. 
\clubsuit ( A ), 435. 
cm (centimeter), 24, 57, 270. 
cmbx fonts, 60, 350, 413, 428, 433. 
cmex fonts, 157, 225, 350, 432-433. 
cmmi fonts, 350-351, 413-414, 430, 433. 
cmr fonts, 16-17, 60, 63-64, 76, 350, 

413, 427, 433. 

cmsl fonts, 63-64, 350, 413, 428, 433. 
cmsy fonts, 157, 350-351, 413-414, 431. 
cmti fonts, 350, 413, 428, 433. 
cmtt fonts, 60, 350, 413-414, 429, 433. 

(code assignment), 277 . 

(codename), 271 . 

codes for characters, 43-46, 367-370. 

Coelho, Manuel Rodrigues, 239. 
coerce (dimen) to (number), 270. 
coerce (glue) to (dimen), 270. 
coerce (number) to (dimen), 336. 

(coerced dimen), 270 . 

(coerced integer), 269 . 

(coerced mudimen), 270 . 

collective signs, see large operators. 

colon, 51, 133-134, 155, 161, 174, 438. 

\colon ( : ), 134, 359 . 4 38. 

color, 229. 

column vector, 177. 

column width, 29, 231, 257, 387, 417. 

\columns, 231, 354. 


comma, 51, 72-73, 134, 161-162, 172-174, 
394 -395, 430. 
commands, 267-293. 
comments, 26 , 47, 337, 340. 
communication between macros, 211, 
328-329, 375-376, 407-408. 
commutative diagram, 182. 
composing stick, 64-65. 
compound fraction, L(3. 
compound matrix, 181. 

Computer Modern fonts, 16, 350, 427-438. 
computer programs, 38, 165, 234- 
concert programs, 408-412. 
conditionals, 206-208, 209 . 210 . 211, 

240, 308, 384. 

\cong ( S ), 151, 360, 436. 
constants, 269 . 270, 308. 
continued fractions, 142 . 

(control sequence) , 275 , 277. 
control sequences, 7-11, 46, 199, 457. 
misspelled, 31-32. 

control space (\u), 8, 10, 19, 73, 74, 86-87, 
154, 163, 167, 283, 285, 290, 351, 381. 
control symbols, 7-8, 46-47. 
control words, 7-8, 38, 46-47, 204. 
coordinates, 389. 

\coprod (large II), 435. 

*\copy (copy a box), 120, 151, 222 , 278, 

329, 37.{, 386, 407. 

\copyright ( © ), ii, 308, 339, 356 . 

Cornet, Peeter, 239. 

Correa de Arauxo, Francisco, 239. 
correspondence, 200, 403-408. 

\cos ( cos ), 162, 361. 

\cosh ( cosh ), 162, 361. 

cost of a page break, 111-113, 124. 

\cot ( cot ), 162, 361. 

\coth ( coth ), 162, 361. 

*\count registers, 118-122, 207-208, 271 , 

276 , 346-347, 379. 

\count0, 119, 207, 252-254, 362. 

*\countdef, 119, 121, 210, 215, 271, 

277 , 346-347. 

(countdef token) , 271 . 

Cowper, William, 35. 

*\cr (end of aligned row), 175-177, 190-197, 
231-238, 245, 248, 275, 282, 351, 352, 
385-386, 412, 418, 421. 
avoiding, 249. 

cramped styles, 140-141, 445. 

*\crcr (force \cr), 249, 275, 282, 361-362, 
385, 412, 421. 

Crocker, Betty, 425. 
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cross, see \dag ( f ), \times ( X ). 
crotchets, see brackets. 

\csc ( esc ), 162, 361. 

*\csname, 40-41, 213, 3.(8 , 375. 
\csname\endcsname, 46, 308. 
cube root, 130-131. 

Cummings, Edward Estlin, 49. 

\cup ( U ), 133, 436. 

curly braces, see braces. 

current font, 13, 20, 154, 163, 213-214. 

current page, 112, 122-125, 278, 280. 

cyclic preambles, 241 , 242, 246, 361 , ^12. 

Cyrillic characters, 370. 

\d (dot-under accent), 52-53, 356 . 

\dag ( f ), 53, 117, 356, 438-439. 

\dagger (f as binary operator), 436, 439. 
Dale, Robert William, 283. 

\danger, 419. 

dangerous bend, v-vi, 5, 15, 44-45, 70, 419. 
Danish characters, 45-46, 52-53, 370. 

\dash, 30. 

dashes, 4, 26, 30, 51, 93, 95. 

\dashv ( H ), 436. 
date, today’s, 406. 

*\day, 273, 349, 406. 
dd (didot point), 57, 270, 272. 

\ddag ( | ), 117, 356, 438-439. 

\ddagger (} as binary operator), 436, 439. 
\ddot (math accent: x), 135, 359 . 

\ddots ( '■ ), 177, 359 . 

\ddt (debugging aid), 248. 

De Vinne, Theodore Low, 107. 
*\deadcycles, 214, 255, 264, 271, 283, 401. 
debugging, 205, 248, 298-303, 347. 
decent lines, 97, 99. 

(decimal constant), 270 . 
decimal points, 57, 134, 240. 

(def), 275. 

*\def, 44, 136, 199-208, 215, 275-276. 
default output routine, 253-255. 
default rule thickness, 443-447. 
default values of parameters, 348-349. 

*\def aulthyphenchar, 273, 348. 

*\def aultskeuchar, 273, 348. 
defining a control sequence, 199-208. 
(definition), 275 . 

(definition text), 275 . 

\deg ( deg ), 162, 361. 
degrees ( ° ), 180. 

*\delcode, 156, 214, 271, 290, 345. 

(delete) (ASCII code 127), 37, 39, 343, 369. 
deleting tokens, 32, 215, 295-297. 

(delim), 289-290. 


delimited arguments and parameters, 
203-205, 249, 263, 375-377, 407. 
*\delimiter, 156 , 289-290, 359. 
*\delimiterf actor, 152, 273 , 348, 446. 
delimiters, 145-150, 156-157, 171, 

290 . 437, 442 . 

*\delimitershortf all, 152, 274, 3.(8, 446 . 
\delta ( 8 ), 127, 434. 

\Delta ( A ), 169, 186, 434. 
demerits, 94, 97-99, 273, 461- 
denominator, 141, 152, 179, 444-445. 
depth, 221, 224, 282, 337. 
depth of a box, 63-67, 77, 80-82, 225. 
Derek, Bo, 293. 
descenders, 63, 113, 319, 323. 
design size, 16-17, 213. 

\det ( det ), 162, 361 . 
device-independent output, 23; 

see also .dvi. 

Diabelli, Antonio, 408. 
diagnostic form of lists, 66, 75, 79, 

158-159, 302, 455. 
dialogs with the user, 217-218. 

\diamond ( o ), 436. 
diamond leaders, viii. 

\diamondsuit (<(>), 435. 

Diaz de la Pena, Maximiliano Antonio 
Temistocles, 384. 

Dick and Jane, 72-74, 76. 

Dictionaries, 259, 449-453. 

Didot, Frangois Ambroise, 57. 
didot point, 57, see dd. 
dieresis ("), 52, 53, 356, see \". 
diesis ( | ), see \ddag. 

Dieter, Ulrich Otto, 14. 

(digit), 269 . 

(digit string), 57. 
digits, 51, 132, 435, 453. 

width of, 60, 241, 427. 

\dim ( dim ), 162, 361. 

(dimen), 59, 61, 71, 118, 270, 271. 

(dimen parameter), 271, 274 . 276. 

*\dimen registers, 118-122, 271, 276 , 
346-347, 349, 360, 363, 395. 

(dimen variable), 276 . 

*\dimendef , 119 , 215, 277, 3.(6-347. 
(dimendef token) , 271 . 
dimensions, 57-61. 

as arguments, 204, 362. 

Dionysius I of Syracuse, 239. 
diphthongs, see \ae, \oe. 
discardable items, 95, 110-112, 124, 393. 
*\discretionary, 95-96, 283, 286, 287 , 292 . 
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discretionary hyphens, 28, 95-96, 453, 455 . 
discretionary multiplication signs, 173, 357 . 
display math mode, 85-89, 289-293. 
display style, 140-142, 441-447. 
Wdisplayindent, 188, 190, 274, 291, 349. 
Wdisplaylimits, 144, 159, 292 . 443. 
\displaylines, 194, 196 , 362 . 
displays, 87, 103, 139-1.(5 , 166-167, 
185-197, 232, 2.(1, 315. 
at beginning of paragraph, 316. 
non-centered, 186, 326, 375-376, 420-421. 
positioning of, 188-190 . 

*\displaystyle, 141-142, 292 , 362. 
*\displayuidowpenalty, 104, 272, 3.(8. 
*\displaywidth, 188, 190, 274, 349. 

Disraeli [Beaconsfield] , Benjamin, earl, 219. 
ditto mark, 53, 441. 

\div ( 4- ), 436. 

*\divide, 118-119, 218-219, 276, 391, 

397, 398, 417. 

\do, 344, 380, 423. 

dollar sign, 4, 38, 51, 54, 86-88, 92, 127, 
134-135, 185-186, 283, 287, 293 . 
Donnelley, Richard Robert, vii. 
\dospecials, 344, 380, 422—423. 

Ndosuperej ect, 256, 364 . 

\dot (math accent: x), 135. 
dot accent ('), see \ ., \dot. 
dot-under accent (.), see \d. 

\doteq ( = ), 361 , 436. 

\dotfill ( ), 244, 33.(, 335, 

340-341, 357 . 

dotless letters, 52-53, 136, 435. 

\dots ( . . . ), 173, 356 : see also ellipses, 
double-column format, 257, 386-388, ^17. 
double dagger, see \ddag, \ddagger. 
double dangerous-bend signs, vi, 419. 
double integrals, 169, 180. 
double quote mark, 52, 53, 134; 
see also hexadecimal. 

*\doublehyphendemerits, 98 , 273, 348, .(51. 
\downarro v ( j. ), 14 6-147, 182, 359, 437. 
\Downarrow ( JJ. ), 146-1.(7, 359 , 437. 
\downbracef ill ( t A j 225—226, 

331, 357 . 

Doyle, Sir Arthur Conan, 401. 

*\dp, 120, 271, 316, 388-389, 417. 
dragon curve, 391. 

Drofnats, Revinu Jitis, 24, 27-28, 73-74, 
404-406, 408, 410-412. 

Dudeney, Henry Ernest, 249, 334. 

Dull, Brutus Cyclops, 131, 173, 260. 

\duimny, 401. 


*\dump, 283 , 286 , 336, 344 ■ 

Durant, William James, 239. 

.dvi, 23, 43, 60, 119, 213, 228, 254, 279, 
280, 302, 367, 374. 

Dvorak, Antonin Leopold, 409. 
dx, 168. 

EBCDIC, 43. 

*\edef, 215-216, 275, 328, 348, 373-374 ■ 
editing, 34, 139, 197. 
efficiency, 329, 333, 342, 345, 347, 383, 
384, 394, 400, 423. 

\egroup (implicit >), 269, 351 , 363, 

382, 385, 407, 421. 

Ehlert, Ernst Fred, 248. 

Ehlert Bohning, Pauline Anna Marie, 248. 
\eightpoint, 415 , ^16. 

\eject (force page break), 24-25, 105, 

109, 189, 353, 4I8, 419. 
elbows, see angle brackets. 

Nell ( l ), 132, 435. 
ellipses (■••), 73, 172-174, 176-177, 
180-182, 245. 

Elphinstone, Mountstuart, 89. 

*\else, 207, 210, 213. 
em, 60, 154, 166, 214, 270, 352, 414, 433. 
em-dash ( — ), 4, 302. 
em quad, see \quad. 
embellished letters, see accents, 
emergency stops, 299-300. 
*\emergencystretch, 107 . 274. 

Emerson, Ralph Waldo, 41. 
emphatics, see dot-under. 

(empty), 268 . 

\empty, 263, 351 , 378. 
empty discretionary, 95, 286. 
empty group, 19, 253, 305. 
empty line in input file, 24, 37, 47, 
3.(0-341, 381. 
at end of file, 217. 
empty line in output, 114, 316. 
empty page, 114. 

Nemptyset ( 0 ), 128, 435. 
en-dash ( - ), 4, 187, 252, 314. 
en quad, 71, see \enspace. 

Wend, 23, 26, 27, 87, 264, 283, 286, 

299, 336, .(03. 

end of an input file, 206, 214, 217. 
end of file line, see (return), \cr. 
end of file page, see (formfeed), 
end of paragraph, 286, see Npar. 
Nendchapter, 418—419. 

*\endcsname, 40-41, 213 , 283 , 348, 375. 
endgame, 87, 264, 283. 
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\endgraf, 262, 286, 331 , 351, 407, 416, 419. 
*\endgroup, 21, 249, 262, 279, 380, 407, 419. 
*\endinput, 47, 214 . 

\endinsert, 115—116, 363 . 

\endletter, 4 03-404, 407 . 

\endline, 351 . 

*\endlinechar, 48, 273, 331, 348, 390-391. 
\endtemplate, 240. 

\enskip, 71, 352 . 

\enspace, 202 , 352 . 4 10. 
enumerated cases, in formulas, 175. 
in separate paragraphs, 102, 340. 
within a paragraph, 92. 
enunciations, see \pro claim, 
epigraphs, 418-419, 425. 

\epsilon ( e ), 1, 128, 434. 

\eqalign, 190-191, 193, 242, 326, 362 . 
\eqalignno, 192-193, 194, 362 . 

(eqno) , 293 . 

*\eqno, 186-187, 189-191, 193, 293, 375-376. 
(equals), 275, 276-277. 
equals sign, 51, 133, 209, 226, 275, 376. 
equation numbers, 186-196. 

\equiv ( = ), 133, 436. 

Erdos, Pal (= Paul), 53. 

*\errhelp, 275, 280, 347. 

*\errmessage, 216, 279-280 , 3,(7, 418- 
error messages, 30-33, 295-301, 308-309. 
error recovery, 31-32, 46, 215, 295-303, 309. 
*\errorcontextlines, 34 . 273, 348. 
*\errorstopmode, 32, 33, 277, 299. 
es-zet ( fi ), see \ss. 

(escape) (ASCII code 27), 369. 
escape character, 7, 37, 421. 

*\escapechar, 40, 213, 228 . 273, 308, 

348 , 377. 

\eta ( r) ), 434. 
etc . , 302. 

Eulerian numbers, 152. 

Evagoras of Salamis, 239. 
even-numbered pages, 252-253, 416- 
*\everycr, 275, 362. 

*\everydisplay, 179, 275, 287 , 326. 
*\everyhbox, 275, 279. 

*\everyjob, 275. 

*\everymath, 179, 275, 287 , 293 , 326. 
*\everypar, 105, 215, 253, 262, 275, 282, 

283 , 333, 381, 407, 421. 

*\everyvbox, 275, 279. 
ex, 60, 154, 270, 356, 433- 
exception dictionary, 449, 452-453. 
exclamation point, 51, 72, 73, 75, 161, 169. 
\exercise, 10, 422 . 


exercises, vii, 1-303. 

*\exhyphenpenalty, 96, 262, 272, 348. 
\exists ( 3 ), 435. 

\exp ( exp ), 162, 361. 

*\expandafter, 40, 213, 215, 260, 308, 

330, 348, 374-380. 

expansion of expandable tokens, 212-216, 
238, 267, 373-374. 
avoiding, 216, 262-263, 377. 
explicit hyphens, 4, 93, 454. 
explicit kerns, 40, 280, 306, 454-455. 
exponents, see superscripts, 
extensible characters, 442. 
extension fonts, 157, 351, 432-433, 441, 447. 
extensions to T^X, 226, 228-229. 
eyestrain, reducing, 59. 

faces, 13, 17, 390. 

(factor), 270. 
factorial, 169, 181. 

*\f am, 154-159, 273, 289-290, 346-347, 

351, 358, .{14-415. 

families, 153-159, 289-290, 346, 431, 442. 
(family assignment), 277 . 

(family member), 271 , 277. 
family tree, 248. 

family 0 (math roman fonts), 153-157, 351. 
family 1 (math italic fonts), 155-157, 351. 
family 2 (math symbol fonts), 157, 351, 

431, 441, 447. 

family 3 (math extension fonts), 157, 

351, 432-433, 441, 447. 

Fatal format file error, 299. 
feasible breakpoints, 99, 315. 
fences, see openings, closings, delimiters, 
ffl, see ligatures. 

*\f i, 207, 210, 213. 

Fibonacci, Leonardo, of Pisa, 166. 
fields of atoms, 158-159, 289-291. 
fil, 72, 118-119 , 271, 348, 394 • 

(fil dimen), 271. 

(fil unit), 271 . 

\f ilbreak, 111, 353 . 

(file name), 214, 216, 226, 277, 278. 
file names, 25, 214, 216-217, 226, 278. 
file pages, 343. 

file types, see .dvi, .fmt, .tfm, .tex, 
log file, terminal, 
fill, 72, 118-119, 271, 347. 
fill page with blank space, see \vfill. 
(filler), 276, 278, 280-282, 289. 
filll, 72, 118, 271, 332. 

*\f inalhyphendemerits, 98, 106, 273, 

348, 451. 
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*\f irstmark, 213, 258, 259-260, 280. 
\fiverm, 153, 350 , 351 , 

\flat ( b ), 409 , 435. 

floating insertions, 115-116, 125, 363. 

*\f loatingpenalty, 123-125, 272, 281, 363. 
floor brackets ( [m, 146-147, see \lfloor, 
\rf loor. 

flush left, 72, 142, 177, 181, 196. 
flush right, 71-72, 106, 142, 177, 196, 

233, 419. 

.fmt, 39, 344. 

\fmtname and \fmtversion, 364. 

\folio (typeset page number), 252-253, 

362 . 406, 416. 

(font), 213, 214, 271, 277. 

*\f ont, 16-17, 60, 210, 213, 214-215, 

271, 276 . 

(font assignment), 277 . 
font metric files, 46, 433. 

(font range), 271 . 

(fontdef token), 271 . 

*\f ontdimen, 76, 157, 179, 214, 271, 277, 
355-356, 375, 390, 433, 441, 447. 
*\fontname, 213, 214. 
fonts, 13-17; 

see also Computer Modern fonts. 
\footins, 256, 363 . 396-399, ^16. 

\f ootline, 252, 256, 340-341, 362 . 
\footnote, 82, 116, 251, 256, 340, 

363 , 382, 416. 

\f ootnoterule, 256, 364 . 
footnotes, 105, 116-117, 121, 125, 

173, 416-417. 
short, 395-400. 

\forall ( V ), 435. 
forbidden control sequence, 206. 

Ford, Patrick Kildea, 293. 

foreign languages, 45-46, 52-54, 370, 

387, 449, 451. 
form letters, 200, 207-209. 
format file, 25-26, 39, 283, 344. 
format-independent documents, 194, 203. 
formats, 11, 39, 200, 403-425, 434. 
(formfeed) (ASCII code 12), 343, 369. 
formulas, 127-199. 

Foss, Sam Walter, 439. 

fractions, 67, 139-143, 152, 170, 179, 

186, 332, 444-445. 
huge, 196. 

slashed form, 67, 139-140, 233, 236. 
Fraktur, 164, 434. 

Franklin, Benjamin, 65. 

French, 54, 340-341, 455. 


\f renchspacing, 74, 340, 351 , 381, 401. 
Frescobaldi, Girolamo, 239. 
front matter, 413. 

\frown (-•''), 436. 
full stop, see period. 

\f ullhsize, 257. 

\fullline, 257. 
funny space, 377. 

*\futurelet, 207, 215, 262, 277, 363, 
375-377, 423. 

\f uturenonspacelet, 376. 

Galilei, Galileo, 101. 
galley file, 303, 425. 

Gamble, James, 38. 

\gamma ( 7 ), 127, 434. 

\Gamma ( T ), 127, 169 , 358, 434. 

\gcd ( gcd ), 162, 192, 361. 

*\gdef, 206, 215, 275, 352, 407. 

\ge ( > ), 9, 45, 175, 318, 361, 438. 

(general text), 276, 277, 279, 280, 287, 292. 
generalized fraction, 152, 157, 292 . 444—445 . 
generic coding, 194, 203. 
generic matrix, 177, 245. 

\geq ( > ), 318, 436. 

German, 52, 96, 451. 

German black letters, see Fraktur. 
\getfactor, 375 . 

\gets ( <— ), 361 , 438. 

\gg ( > ), 436. 

Gibbon, Edward, 117. 
nglobal, 21, 119, 206, 218, 232, 256, 275, 

301, 307, 320, 346. 

(global assignment), 179, 277 . 

*\globaldef s, 238, 273, 275. 
glue, 63, 69-83, 95, 110, 157, 222-225, 

302, 412. 

above and below displays, 189-190, 194. 
at top of page, 113-114, 124, 256. 
between aligned columns, 237-239, 

247, 392. 

between lines, see interline glue, 
between paragraphs, 79, 104-105, 262, 
282, 342, 406, 417. 

between words, 74-76, 356, 393-394, 433. 
(glue), 71, 118, 271 . 

\glue, 75, 79, 302. 

(glue parameter), 271, 274 , 276. 
glue set, 79, 302. 
glue set order, 77, 79, 81, 97. 
glue set ratio, 77, 79, 81, 97. 

(glue variable), 276 . 

goal height, 112-114, 123-125. 

Goethe, Johann Wolfgang von, 183. 
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\goodbreak, 111, 116, 353 . 

Grandmaster, 253. 

\grave (math accent: x ), 135. 

grave accent ('), see V, \grave. 

greater than or equal, see \ge. 

greater than sign, 52, 53, 133, 150, 209. 

Greek, 127-128, 137, 156, 319, 358, 430, 434. 

Green, Walter, 244. 

grouping, 13-14, 19-21, 119-120, 122, 

200, 201, 241, 259, 279, 283, 286, 291, 
301, 375, 378, 385, 453. 
characters for, 39-40, 381-382. 
implicit, 115, 148, 194, 253, 287, 292, 293 . 
guide words, 259. 

\H (long Hungarian umlaut), 52, 53, 

356 , 420. 

H&J, see hyphenation, line breaking, 
setting glue. 

hacek accent ("), see \v, \check. 
halftones, 228. 

*\halign, 117, 190, 193, 194, 235-249, 

282 , 286, 291, 302, 326, 352, 

361-362, 386 , 392. 
compared to tabbing, 235. 

Halmos, Paul Richard, 183. 

Hamza, see apostrophe. 

Viang, 355 , ) 19. 

*\hangafter, 14)2, 103-104, 273, 

348-349, 419. 

*\hangindent, 102, 262, 274, 349, 407. 
hanging indentation, 79, 102-103. 
hanging punctuation, 394-395. 

Hardy, Godfrey Harold, 321. 
harpoons, 437. 

hash mark, 38, 51, 113, 200-202, 203 . 

204-205, 228, 235, 236-240. 

Hassler, Hans Leo, 239. 

hat, 38, 51, 128-130, 134, 369, .{23. 

\hat (math accent: x), 135-136, 164. 
hat accent (’), see \~ , \hat, \widehat. 
*\hbadness, 29, 272, 302, 348, 387-388, 401. 
\hbar ( h ), 169, 358, 435. 
hbox (box with horizontal list inside), 64. 
*\hbox, 64-67, 77, 86, 93, 151, 159, 163, 

175, 179, 185-186, 221, 222, 278, 

282, 388-389. 

Vieadline, 252-253, 255, 362, 406. 
\heartsuit ( G ), 435. 

Hebrew, 66. 

height, 221, 224, 282, 337. 

height of a box, 63-67, 77, 80-82, 225. 

help messages, 32, 280, 295-296. 

(hex digit), 269 . 


(hexadecimal constant), 269 . 
hexadecimal notation, 43, 44 , 45, 47-48, 
154, 219, 336, 420. 

*\hf il, 71-72, 194, 235-237, 283, 285, 

290, 397. 

*\hf ill, 71-72, 142, 177, 194, 233, 

283, 285, 290. 

*\hf ilneg, 72, 100, 233, 283, 285, 290, 397. 
*\hfuzz, 30, 274, 302, 348, 387-388. 

\hglue, 352 . 

\hideskip, 347 . 348, 354. 

\hideuidth, 243, 245, 247, 325, 354 . 
higher-level languages for composition, 203. 
Highton, Albert H., 481. 

Hilbert, David, 167. 

*\hof f set, 251, 274 , 342. 

*\holdinginserts, 125 , 273, 400. 

Holmes, Thomas Sherlock Scott, 401. 

\hom ( horn ), 162, 361. 

Iiombert, Humbert, 401. 

Honeywell Information Systems, 1. 

\hooklef tarrou ( ), 358, 437. 

\hookrightarrow ( '— t ), 358, 430, 437. 
hooks, see \subset, \supset. 
horizontal braces, 176, 225-226, 339. 
(horizontal command), 283 . 
horizontal lists, 64, 94-95. 
horizontal mode, 85-89, 105, 285-287 . 
(horizontal mode material), 278. 

(horizontal rule), 281. 

horizontal rules, 24, 64, 221-226, 246, 282. 

(horizontal skip), 285. 

Hornschuch, Hieronymus, 483. 

Howard, Jane Temple, 21. 

\hphantom, 178, 211, 360 . 

*\hrule, 24, 64, 85, 221-225, 246, 281-282, 
286, 357, 420, 421. 

\hrulefill ( ), 244 - 252, 357 . ^12. 

*\hsize, 26-27, 60, 102, 188, 237, 251, 

257, 274, 340-341, 348, 387, 4 06, 

407, 413, 415, 41 7. 

*\hskip, 71, 86, 168, 283, 285, 290, 314 . 
*\hss, 71, 82-83, 233, 283, 285, 290, 442. 
*\ht, 120, 271, 388-389. 417. 

Hungarian umlaut ("), see \H. 
hyphen, 4, 51, 93, 95, 132, 292. 
hyphen.tex, 364, 453 . 
hyphenation, 28, 39, 96, 306, 314, 394-395, 
414, 424, 449-455. 

*\hyphenation, 277, 419, 452-453. 455. 
(hyphenation assignment), 277 . 
*\hyphenchar, 95, 214, 271, 273, 277, 286, 
351, 395, 414 , 454, 455. 
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♦\hyphenpenalty, 96, 101, 272, 348, ^51. 

\i ( i ), 52-53, 356 . 

I can’t go on, 299. 

Malign, 354 . 

IBM Corporation, vii. 
identifiers in programs, 38, 165, 234- 
*\if , 209, 210-211, 307, 377, 379. 

*\if case, 210, 349, 373, 390, 406. 

♦Ufcat, 209, 210, 307, 377. 

*\ifdim, 209, 353, 387, 417. 

*\ifeof , 210, 217. 

\iff ( ^=7 ), 163, 361, 438. 

♦Uffalse, 210, 211, 260-261, 348, 385-386. 
*\ifhbox, 210, 392, 399. 

♦Ufhmode, 209 , 363. 

♦Ufinner, 209 . 

♦Ufmmode, 209 , 215, 240, 353 , 356, 360, 423. 
*\ifnum, 208, 209, 218-219. 

*\if odd, 207, 209, 416, 418-419. 

♦Uftrue, 210 , 211, 260-261, 348. 

\if undefined, 40. 

*\ifvbox, 210 . 

♦Ufvmode, 209 . 

*\ifvoid, 210 . 256. 

*\ifx, 210 , 215 , 307, 384, 375-377, 4 1 8. 
ignored characters, 37, 390. 

♦Ugnorespaces, 279 , 333, 355, 4%4- 
! Illegal unit, 295. 
illustrations, 115-116. 

fitting copy around, 101. 

\Im ( 5} ), 435. 

\imath ( 1 ), 136, 435. 

♦Ummediate, 226-228, 280, .(22, 423. 
implicit braces, 269 , see \bgroup, \egroup. 
implicit characters, 269, 277, 309, 

351, 376-377. 

implicit kerns, 306, 454 , 455. 
in (inch), 24, 57, 270. 

\in ( £ ), 128, 147, 436. 

Mnaccessible, 297. 
inch, 57-58. 

incomplete conditionals, 206. 

♦Undent, 86, 94, 101, 263, 282, 286, 291, 355. 
indentation, 86, 222, 282. 

hanging, 79, 102-103. 
indention, see indentation, 
indexes, 261-263, 392-394, 423-425, 481. 
indices, see subscripts. 

\inf ( inf ), 162, 361. 
inferiors, see subscripts, 
infinite badness, 97, 107, 111, 229, 317. 
infinite glue, 71-72, 118-119, 256, 313, 332. 
infinite loop, 299, 301, 383. 


infinite penalty, 97, 111, 254-256, 

264, 286, 400. 

\infty ( 00 ), 9, 318, 435. 

inhibiting expansion, 216, 262-263, 377. 

IN HEX, 39, 41, 76, 157, 283, 336, 

343-345, 453. 

Inner atom, 158, 170, 289, 443, 445-446. 
♦Unput, 7, 9, 25-27, 47, 199, 214, 217, 

380, 382-383, 403, 422. 
input/output commands, see Unput, \read, 
\urite, \message, \dump. 
input/output streams, 346, see \openin, 
\openout. 

input stack, 300, 374. 

♦\inputlineno, 214, 271. 

♦Unsert, 95, 122-125, 259, 280-281, 

363, 416, 424, 454. 
inserting text online, 31. 
insertions, 110, 115-117, 122-125, 256, 335. 
♦\insertpenalties, 111, 114, 123—125 , 

214, 254, 256, 271. 

Unt (large /), 144, 168-169, 192, 358, 435. 
(integer constant), 269-270 . 

(integer parameter), 271, 272-273 , 276. 
(integer variable), 276 . 
integral signs, see \int, \smallint. 
multiple, 169, 180. 

inter-column spacing, 237-239, 247, 392. 
interacting with T^vX, 31-34, 217-218, 

228, 295-299. 

(interaction mode assignment), 277 . 
Unterdisplaylinepenalty, 193, 349, 362. 
\interf ootnotelinepenalty, 349, 363. 
interline glue, 78-79, 80, 104, 105, 221, 245, 
263, 281-282, 335, 352, 409. 
♦Unterlinepenalty, 104 , 272, 363, 406, 419. 
internal box-and-glue representation, 66, 

75, 79, 158-159, 302, 455. 
internal character codes, 43-46, 367-370. 
(internal dimen), 271 . 

(internal glue), 271. 

(internal integer), 271. 

(internal muglue), 271. 

(internal quantity) , 279 . 

Internal Revenue Service, 200, 208-209, 

244, 404. 

(internal unit), 270 . 

internal vertical mode, 85, 87-89, 

222, 278-283. 
interrupts, 33, 299, 383. 
interword spacing, 74-76, 356, 393-394 , 433. 
Interwoven alignment preambles, 299. 
(intimate assignment), 277 . 
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invalid characters, 37, 45. 

\iota ( l ), 325 , 434. 

Isocrates, 239. 

\it (use italic type), 13-14, 165, 231-232, 
332 , 351, 409 , 414-415 , 428. 

italic corrections, S.j, 287 , 306, 441, 455. 
italic letters with descenders, 319. 
italic type, 13-14, 100, 127, 409, 428, 430. 
\item, 102-103 , 117, 340-342, 355 , 4 1 6 , 419. 
\itemitem, 102, 342, 355 . 
iteration, see \loop. 

Mtfarn, 351 . 

Iur’ev, Sergei Petrovich, 53. 

\j ( J ), 52, 356. 

Jevons, William Stanley, 5. 

\jmath ( j ), 136, 435. 

*\jobname, 213 , 214, 336. 

Johnson, Samuel, 89. 

\joinrel, 358 . 
jokes, vi, 303, 449. 

\jot, 194, 242, 349, 362. 

Joyce, James Augustine, 100. 
justification, see setting glue, line breaking. 
Justus, Paul E., 455. 

\kappa ( k ), 128, 434. 

\ker ( ker ), 162, 361. 

*\kern, 10, 40, 66, 75, 87, 168, 256, 263, 280, 
306, 389, 394 -395, 4 1 6, 42.{, 454-455. 
Kernighan, Brian Wilson, 159. 
kerns, 4, 66, 75, 95-97, 110, 157, 168, 280, 
286, 306, 444, 454-455. 

Key Index, 392-394. 
keyboards, 3, 5, 43-46, 368-370. 
keywords, 61, 71, 268 , 337, 370. 

Knuth, Donald Ervin, i, ii, vii, 74, 92, 211, 
259, 337, 340-341, 412-413, 424-425. 
Knuth, Nancy Jill Carter, iii, 125, 418. 
Kochel, Ludwig, Ritter von, 409. 

1 after fil, 271 , 337. 

£, see pound sterling. 

\1 ( 1 ), 52-53, 356- 
\L ( L ), 52-53, 356. 

\lambda ( A ), 176, 434. 

\Lambda ( A ), 323, 434. 

Lamport, Leslie B., 137. 

\land ( A ), 133, 361, 438. 

Mangle ( ( ), I 46 -I 47 , 150, 156, 359, 437. 
*\language (hyphenation method), 

273, 346, 455. 

large delimiters, 145-150, 442. 

large operators, 144~145, 154-155, 358, 

435 ; see also Op atom. 


*\lastbox, 222, 278, 354, 392 , 398, 399. 
Mlastkern, 214, 271. 

Mlastpenalty, 214, 271. 

Wlastskip, 214, 223, 271, 392. 

IATeX, 137. 

Ubrace ( { ), 146-147, 359, 437. 

Ubrack ( [ ), 146-147, 351, 369, 437. 
*\lccode, 41, 214, 271, 345, 452-454. 

\lceil ( f ), 146-147, 359, 437. 

\ldotp, 358, 359 , 438. 

\ldots (...), 73, 172-174, 177, 180-181, 
199-201, 359, 438. 

\le ( < ), 9, 45, 133, 162, 318, 361, 438. 
leaders, 222, 223-226, 228, 280-282, 285, 

290, 357, 392-39.{. 

(leaders), 281 . 

nieaders, 95, 110, 223, 224, 225, 357, 
392-394. 

leading, see \baselineskip, \vskip. 
Meavevmode, 313, 333, 356 , 408, 4%0. 

Lee, Marshall, 17. 

*\left, 148-150, 155-157, 171, 196, 292, 437. 
left brace, 13-14, 19-21, 38, 51, 200-202, 
203-204, 205-206, 216, 269, 275-276, 
283 , 286 , 291 , 330. 

(left brace) , 275 . 

left bracket, 51, 134, 146 -I 48 , 171, 4 O 8 , 437. 
left delimiters, see openings, 
left-hand pages, 252-253, 436. 
left parenthesis, 51, 134, 140 , 145-150, 345. 
left quote, 3-5, 132, 134, 305, 394-395; 

see also alphabetic constant. 

Meftarrow ( 1— ), 226, 437. 

\Leftarrow ( ), 226, 437. 

\lef tarrouf ill (t ), 357 . 

\lef tharpoondoun ( ^ ), 437. 

\lef tharpoonup ( *— ), 437. 

*Mef thyphenmin, 273, 364, 454 , 455. 
Meftline, 101, 257, 259-260, 326, 353 . 

Mef trightarrow ( -fA ), 437. 

\Lef trightarrow (<=>), 437. 

Wleftskip, 100, 274, 317, 407, 419. 

Legendre symbol, 152. 

Leontief, Wassily Wassily, 265. 

Meq ( < ), 318, 358, 436. 

Meqalignno, 192, 194, 362 . 

*\leqno, 187, 189, 293, 375-376. 

Lesk, Michael Edward, 247. 
less than or equal, see \le. 
less than sign, 52, 53, 133, 150, 209. 

*\let, 206-207, 215, 277, 307, 309, 352, 376. 
(let assignment), 277 . 
letterformat.tex, 403, 406—408 . 
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letterheads, 407. 

letters, 7-8, 37, 41, 45-46, 51, 132, 

157, 344, 370. 

Ufloor ( [ ), 146-147, 359, 437. 

Mg ( lg ), 162, 361. 

Mgroup, 150, 176, 359, 437. 

\lhook, 358 . 

Liang, Franklin Mark, 449. 
library of macros, 199, 382-384, 425. 
lies, vii, 303. 

ligatures, 4, 19, 46, 51, 54, 75, 95, 165, 

286, 302, 381, 427, 444, 454. 

\lim ( lim ), 162, 163, 361. 

Uiminf ( liminf ), 162, 163, 178, 361. 
♦Mimits, 144, 159, 292, 359, 443. 
limits above and below operators, 

144-145, 149, 179. 

\limsup ( lim sup ), 162, 163 , 178, 361 . 
Lincoln, Abraham, 11. 

Mine, 72, 77, 101, 224, 232 , 252, 

255-257, 353, 412. 

line breaking, 97-100, 173-174, 392-395, 
398-400. 

line breaks, avoiding bad, 27-30, 91-94, 
173-174, 197. 
forcing good, 94, 114. 
line rules, see \hrule, \vrule. 

(linefeed) (ASCII code 10), 369. 
♦Minepenalty, 98, 272, 314, 316, 348. 
lines of input, 24, 46-48 , 340. 

♦Mineskip, 78-80, 104, 194, 274, 281, 

349, 351-352. 

♦Mineskiplimit, 78-80, 104, 194, 274, 

281, 349, 351-352, 362. 
list macros, 378-380. 

Misting, 380, 391. 

\11 ( < ), 431, 436. 

Llanfair P. G., 451. 

Mlap, 82-83, 189, 340-341, 353 , 355, 

381, 416-417, 422. 

Mmoustache, 150, 359 , 437. 

\ln ( In ), 162, 186, 361. 

Mnot ( ), 361, 438. 

local, 19-21, see grouping. 

Locke, John, 303. 

Mog ( log ), 162, 169-170, 361. 
log file, 10, 28, 66, 226, 303, 347. 
logical operators, see Mand, \lor, \lnot. 
logo, 1, 8, 412, see \TeX. 

♦Mong, 205-206, 210, 275, 331, 375, 378, 382. 
long formulas, 195-197. 

Mongindentation, 407. 

Monglef tarrow ( t — ), 358 , 437. 


Monglef tarrow ( < 1 = ), 358 , 437. 

Monglef trightarrow ( t 1 ), 358 , 437. 
Monglef trightarrow ( <=> ), 358 . 437. 
Mongmapsto ( 1 — > ), 358 . 437. 
Mongrightarrow ( — ), 325, 358 . 437. 
Mongrightarrow ( => ), 358 . 437. 
looking ahead, 207, 376-377. 

Moop, 217-219 , 352, 373-374, 387, 417. 
loose lines, 97, 99, 302. 

♦Mooseness, 103-104, 109, 273, 342, 349. 
Mor ( V ), 133, 361, 438. 
low-resolution printer, 59. 

*\lower, 66, 80, 151, 179, 285, 290- 
♦Mowercase, 41, 215, 279, 307, 345. 
lowercase letters, 9, 268, 370, 453. 

Mq ( 1 ), 5, 48, 351, 369, 395. 

Luckombe, Philip, 447. 
lxix, 420. 

\m@ne, 347 . 

\m@th, 353 . 

machine-independence, 58; see also .dvi. 
Macro, Naevius Sertorius, 219. 
macro arguments and parameters, 

33, 200-205, 249, 263, 362, 363, 
375-380, 407. 

(macro assignment), 275 . 
macro conventions, 121, 346, 364. 
macron accent (~), see \=, \bar. 
macros, 199-219, 373-401. 

to save typing in math, 136, 199-200, 324- 
♦Mag, 60, 270, 273, 348. 

Magnification, 17, 59-60, 364 , 403-404 ■ 
magnified output, 16-17, 59-60, 403, 433. 
Magstep, 17, 59-60, 349 . 403-404- 
Magstephalf , 17, 349 . 403. 
mail, 403-408. 

main vertical list, 85, 110, 112 . 125, 

253-254, 281. 

\maintoks, 262. 

\makef ootline, 255—257, 364. 
Makeheadline, 255, 257, 364 . 

Makelabel, 403, 404^405, 408 . 
manfnt, 44, 408, 414. 
manmac.tex, 413—425. 

Manual, 44, 414 , 410-420. 

Mapsto ( 1 — t ) , 128, 358 , 437. 

\mapstochar ( 1)1 358. 

Margin, 415, 424. 

marginal hacks, 82, 105, 400, 415, 424. 
margins, see \hsize, \narrower. 

♦Mark, 95, 157, 216, 258-263, 280, 417, 454. 
marks, 95, 110, 157, 213, 258-263. 
markup commands, see control sequences. 
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math accents, 135-13 7, 141, 164-165, 

359 . 435, 443 . 

(math atom), 291 . 

(math character) , 289 . 
math character codes, 154-157. 

(math field), 289 . 

math fonts, 157, 351 , 430-433, 441, 447. 
math formulas, how to type, 127-197. 
math italic, 164-165, 409, 430, 433. 
math lists, 157-159, 441-446. 
math mode, 85-89, 127, 157, 289-293. 
math spacing table, 170-171. 

(math symbol), 289 . 
math symbols, 127-128. 

construction of, 151, 178, 358-361. 
table of, 434-438. 
math-off, 95-97, 287, 447. 
math-on, 95, 97, 287, 447. 

*\mathaccent, 157, 170, 291 , 359, 443 . 
*\mathbin, 155, 291, 361. 

*\mathchar, 155, 289. 

*\mathchardef , 155, 199, 214, 215, 272, 

277, 289, 336, 394. 

(mathchardef token), 271. 

*\mathchoice, 151, 157, 292. 

*\mathclose, 155, 291, 322, 359. 

*\mathcode, 134, 154-155. 214, 271, 289, 

319, 326, 844- 

mathematical expressions, 127-197. 
Mathematical Reviews, 106, 392-394. 
(mathematical skip), 290. 
mathematical style, 166-167, 182-183. 
\mathhexbox, 356 . 

*\mathinner, 155, 171, 199, 291, 359. 
*\mathop, 155, 178, 291, 324-325, 361. 
*\mathopen, 155, 291, 322, 359. 

*\mathord, 88-89, 155, 291. 

\mathpalette, 151, 360 . 

*\mathpunct, 155, 291. 

*\mathrel, 155, 291, 359-361. 

\mathstrut, 131, 178, 360 . 

*\mathsurround, 97, 162, 274, 305, 314, 

323, 353, 447. 

matrices, 176-178, 181; see also alignments. 
\matrix, 176-178, 182, 325, 361. 

\max ( max ), 162-163, 170-171, 361. 
*\maxdeadcycles, 255 , 273, 348. 

*\maxdepth, 112-114, 123-125, 255, 262-263, 
274 , 348, 400, 415. 

\maxdimen, 58, 188, 262-263, 347 , 348. 
maximum legal dimension, 58. 
maximum legal integer, 118. 

*\meaning, 213-215 , 336, 382. 


measure, see \hsize. 

\medbreak, 111, 113, 353 , 355, 419, 4 22. 
*\medmuskip (medium math space), 

167-168, 274, 349, 446. 

\medskip (medium extra vertical space), 70, 
79, 102, 109, 111, 352, 410-412. 
\medskipamount, 349 , 352-353, 355, 407. 
\medtype, 408—411. 
membership, see \in, \ni, \notin. 
memory space, 100, 300-301, 342, 345, 

347, 383, 384, 394, 400. 

*\message, 216, 217-218, 227-228, 279, 308, 
328, 343-344, 355, 418. 

META FONT, 420, 427, 442, 483. 
metric units, 57-60. 

\mid ( | ), 174, 436. 

\midinsert, 116, 340-34 1 363 . 
migration, 105, 117, 259, 280-281, 

282, 286, 287. 

\min ( min ), 162-163, 170-171, 361. 

minus, 71, 271, 337. 

minus sign, 4, 51, 127, 132, 226, 268. 

! Missing something, 296-297, 337. 

\mit (math italic family), 164, 351 . 430, 4^4- 
*\mkern, 168, 280 . 442 . 
mm (millimeter), 57, 270. 
mod, 164, 322, 361 . 

\models ( j= ) , 358 , 436. 
modes, 46, 85-89, 175, 267-293. 
modes.tex, 88—89. 
money, 54, 140, 208-209. 

*\month, 273, 349, 406. 

Morris, William, 107. 

moustaches, 150. 

mouth, see anatomy of T)?X. 

*\moveleft, 80-81, 282, 287. 

*\moveright, 80-81, 221, 282 . 

Moxon, Joseph, 287. 

Mozart, Johann Chrysostom Wolfgang Gott- 
lieb (= Theophilus = Amadeus), 409. 
\mp ( =p ) , 133, 436. 

*\mskip, 168, 290, 442. 
mu (math unit), 168, 270, 442. 

\mu ( p ), 162, 434. 

(mu unit), 270 . 

(mudimen), 270 . 

(muglue), 118, 167-168, 271 . 

(muglue parameter), 271, 274 , 276. 

(muglue variable), 276 . 

Muir, Cockburn, 283. 

Muirhead, James, 21. 

multicolumn format, 257, 396-397, 417. 

multiple integrals, 169, 180. 
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*\multiply, 118-119 , 218, 276, 349, 391, 398. 
\multispan, 243, 246-247, 334, 354 . 

Munster, 55. 

(mushrink), 271 . 
music, 408-412. 

*\muskip registers, 118, 168, 271, 276 . 
*\muskipdef , 119, 215, 277 . 

(muskipdef token), 271 . 

(mustretch) , 271. 

\nabla ( V ), 435. 
names, 73, 92. 

\narrower, 100 , 340-341, 355 . 

National Science Foundation, vii. 

\natural ( ^ ), 435. 
natural width, 69. 

Naur, Peter, 268. 

Yne ( yt ), 9, 45, 133, 318, 361, 438. 

\nearrow ( y ), 437. 

\neg ( -i ), 435. 

negated relations, 436-437. 

negative dimensions, 66, 222. 

\negthinspace, 332, 352 . 

\neq ( ), 318, 358, 438. 

nesting (i.e., groups inside groups), 

20-21, 210, 385. 

new symbols, 151, 178, 358-361. 

\neubox, 121, 346, 341, 353, 394, 417. 
\newcount, 121, 218, 346, 347 , 349, 418. 
\newdimen, 121, 346, 317 . 349, 415. 

\neuf am, 121, 157, 346, Ml, 351. 

\newhelp, 346, 347 . 

\newif , 211, 218, 348, 35.(, 375, .{16, 423. 
Ynewinsert, 121, 122, 346, Ml, 363, 415. 
\neulanguage, 346, 347 . 

*\newlinechar, 228 . 273, 348. 

\neumuskip, 121, 346, 347 . 

\newread, 121, 216, 346, 347 . 

\neuskip, 121, 346, 347 . 349, 394, 414- 
Ynewtoks, 121, 212, 262, 346, 341, 401. 
\neuwrite, 121, 227, 346, Ml, 422-423. 
\next, 352 . 

\ni ( 9 ), 436. 

\ninepoint, 15, 414 , 419. 

\ninerm, 15, 413. 

\ninesl, 15, 413. 

No room, 347. 

*\noalign, 176, 191, 193, 237, 246, 249, 

282 . 285 . 286. 

*\noboundary, 283, 286 , 290. 

\nobreak (inhibit line or page break), 97, 
109, 17.{, 193, 335 , 353 , 394, 4 07. 
*\noexpand, 209 , 213 , 215 , 216, 348, 377, 424 . 


*\noindent, 86, 188, 262-263, 283 , 286, 

291, 340-341, 355, 4 1 9. 
\nointerlineskip, 79-80, 255, 331, 

352 , 389. 

*\nolimits, 144, 159, 292, 358, 361. 
(non-macro assignment), 275 . 
nonaligned leaders, 224-226. 
\nonfrenchspacing, 74, 351 . 

*\nonscript, 179, 290 . 442, 446. 
*\nonstopmode, M, 277 . 299. 
\nopagenumbers, 251-252, 362 . 406, .{09. 
norm symbol, see absolute value, 
vertical line. 

(normal dimen), 270 . 

(normal integer), 269 . 

(normal mudimen), 270 . 

\normalbaselines, 325, 349, 351 . 414^415- 
\normalbaselineskip, 349 , 414^415- 
\normalbottom, 363 . 

\normallineskip, 349 , 351. 
\normallineskiplimit, 349 , 351, 362. 
Norwegian characters, 45-46, 52-53, 370. 
\not ( /), 133, 358, 436- .{37. 

\notin ( (f: ), 360 , 437. 
nth, 323. 

\nu ( v ), 128, 163, 434. 
nucleus, 158-159, 289-292, 441-446. 

\null, 311, 332, 351 . 

(null) (ASCII code 0), 37, 39, 48, 343, 369. 

null control sequence, 46, 308. 

null delimiter, 149-150, 152, 156, 345, 

360, 362. 

null set, see \emptyset. 
*\imlldelimiterspace, 150, 274, 3.{ 8, 442 . 
*\imllf ont, 14, 153, 271, 433. 

(number), 44, 118, 269-270, 272, 309, 349. 
*\number, 40-41, 213, 214, 252, 406, 424 . 
number sign, see hash mark, 
numbered footnotes, 121, 125. 
numerals, see digits, roman numerals, 
numerator, 141, 152, 179, 444-445. 
(numeric variable), 276 . 
numerical tables, 240-241. 

\nuarrow ( \ ), 437. 

0 versus 0, 132. 

\o ( 0 ), 356 . 

\0 ( 0 ), 356- 

\oalign, 356 . 
obelisk or obelus, 53. 

\obeylines, 94, 249, 262, 342, 352 , 
380-382, .{07, 419. 

\obeyspaces, 254, 308, 342, 352 , 380-381, 
39.{, 421. 
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oblique, see slanted. 

(octal constant), 269 . 

(octal digit), 269 . 

octal notation, 43, .j .{. 155, 420. 

odd-numbered pages, 252-253, 1,16. 

\odot ( 0 ), 436. 

Noe ( oe ), 52, 356. 

\0F. I (K ), 52, 53, 356 . 

Office of Naval Research, vii. 

\off interlineskip, 245-247, 312 , 352 . 4 1 6. 
\oint (large f), 358 , 435. 

Noldstyle, 351 . 
oldstyle numerals, 430, 435. 

\omega ( lo ), 323 , 358 , 434. 

NOmega ( Q ), 358, 434. 

Nomicron, 434. 

Nominus ( © ), 436. 

*Nomit, 240, 243-244, 246-247 , 282. 
one half, 141, 186. 

(one optional space) , 269-270 . 

online interaction, see interacting with TJ^X. 

Nooalign, 356 . 

Op atom, 158-159, 170-171, 289, 442-444. 
Open atom, 158, 170-171, 289, 442-444, 446. 
open intervals, 171. 
openface, see blackboard bold. 

*Nopenin, 216-217, 280. 
openings, 134, 147, 154-155, 359, 437 ; 
see also Open atom. 

*Nopenout, 226-228, 254, 280, .{22, 423. 
Nopenup, 194, 237, 242, 362 . 

Noplus ( © ), 9, 154, 436. 
optimization of macros, 342, 345, 348. 
(optional assignments), 286 . 

(optional by), 276 . 

(optional sign), 57. 

(optional signs), 269 . 

(optional spaces), 268 , 269-271. 

(optional true), 270 . 

*Nor, 210, 213, 406. 

Ord atom, 158, 170-171, 289, 358, 442-446. 
ordinary symbols, 132, see Ord atom. 

Ore, Oystein, 53. 
organists, 239. 

organs, 38-39, 46, 85, 267 , 373, 456 . 
orphan, see widow word. 

Osbourne, Lloyd, 67. 

Noslash ( 0 ), 436. 

Nother, 421. 
other character, 37. 

Notimes ( ® ), 9, 436. 

*Nouter, 206, 210, 275, 354, 357, 

418-419, 422. 


*Noutput, 125, 253, 254-257, 275, 364, 

370, 417. 

output routines, 21, 112, 251-264, 417. 
when invoked, 122, 125, 281. 
*Noutputpenalty, 125 , 254-255, 273, 

349, 400, 417. 

*Nover, 139-141, 148, 152, 292, 437, 444-445. 
Over atom, 158, 289, 443. 

Noverbrace, 176, 225, 359 . 
overfull boxes, 27-30, 94, 229, 238, 

302-303, 307, 400. 
avoiding, 107. 

*Noverf ullrule, 274, 307, 348. 
overlaps, 82-83, 386. 

Noverlef tarrow, 359 . 

*Noverline, 130-131, 136, 141, 170, 291, 443. 
overloading, 54, 243. 

Noverrightarrow, 226, 359 . 
*Noverwithdelims, 152, 292, 444—445. 

Nowns ( 3 ), 361 , 438. 

NP ( 1 ), 53, 117, 356, 438-439. 

Np@, 347, 348. 

NpQrenud, 361 . 

page breaks, avoiding bad, 109-111, 

189, 193. 

forcing good, 109-111; see also Neject. 
page builder, 110-114, 122-125, 281. 

when exercised, 122, 280-283, 286-287. 
page format, modifying, 251-253. 
page make-up, 109, see output routines, 
page numbers, 21, 23, 119, 207, 251-253. 
Npagebody, 255-257, 364 . 

Npagecontents, 256, 364 . 

*Npagedepth, 114, 123, 214, 271. 

*Npagef illlstretch, 114 , 214, 271. 

*Npagef illstretch, 114 , 214, 271. 

*Npagef ilstretch, 114 , 214, 271. 

*Npagegoal, 114, 123, 214, 271. 

Npageinsert, 115, 363 . 

Npageno, 252, 256, 340, 362 . 406. 
npageshrink, 114, 123, 214, 271. 
*Npagestretch, 114 . 214, 271. 

*Npagetotal, 114, 123, 214, 271. 

*Npar, 47, 86-87, 100, 135, 202, 249, 262, 

283 , 286 , 340, 351, 380-381. 
forbidden in arguments, 205, 207. 
paragraph, implied beginning of, 85-86, 283 . 
implied end of, 24, 86, 286 . 
last line of, 99-100. 
shape parameters reset, 103. 
paragraph sign, 53. 

Nparallel ( [| ), 436. 

(parameter text), 203, 275. 
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parameters, see macro arguments and 
parameters. 

parameters, numeric, 119, 342; 
see also \f ontdimen. 
default values, 348-349. 
table, 272-274. 

parentheses, 51, 129, 134, 140, 145-150 , 
345, 437. 

roman, in italic text, 409-411. 

*\par fill skip, 100, 188, 274, 286, 307, 
315 , 332, 348 , 394 , 419. 
*\parindent, 86, 100, 101-102, 105, 262 , 
274, 282, 286, 291, 342 , 348, 355 , 
394 , 4 06, 4 1 5. 

*\parshape, 101-102, 214, 271, 277, 283, 
315, 349, 374. 

*\parskip, 79, 104-105, 262, 274, 282, 
342, 348, 355, 406, 41 7. 

\partial ( d ), 147, 435. 

Pascal, Blaise, 101-102. 

\path, 390-391. 

*\patterns, 277, 453 . 455. 

patterns for hyphenation, 449-453. 
*\pausing, 273, 303 . 

Paxton, William Hamilton, 425. 
pc (pica), 57, 270, 415. 
penalties, 95-100, 110-114, 189. 
infinite, 97, 111, 286, 400. 
negatively infinite, 97, 111-112, 114, 
254-256, 264, 400. 

*\penalty, 79, 97, 110-111, 174, 280, 353. 
\penalty-’ 10000000000, 264. 
percent sign, 26, 38, 39, 43, 48, 51, 113, 
124, 249, 337, 340. 
period, 51, 72-73, 133-134, 149, 161, 

345 , 394 -395, 430. 
space after, 73-75 ; 76. 
periodic preambles, 241-242, 244 , 

246, 361, 412. 

Perlis, Alan J., 365. 

\perp ( A ), 358 , 436. 

\phantom, 131, 178, 211, 360 , ^12. 

\phi ( <j> ), 128, 148, 434. 

\Phi ( $ ), 323, 434. 

Philips, Peter, 239. 
philosophers, 100, 239. 

(physical unit), 270 . 

\pi ( 7T ), 9, 87, 137, 148-149, 434. 

\Pi ( II ), 9, 324, 434, 435. 
piano, 17. 

pica, 57, 413, see pc. 

pictures, 228-229. 

pieces of symbols, 145, 432, 442. 


pilcrow, see \P. 

plain.tex, 342—364. 

plain TeX format, 10-11, 343-364 . 

summary, 339-342. 

\plainoutput, 255, 364 . 

Plass, Michael Frederick, 340-341. 

Plato, 1, 239. 
plus, 71, 271, 337. 

(plus or minus), 268 . 
plus sign, 51, 132, 268. 

\pm ( ± ), 133, 358, 436. 

\pmatrix, 176, 323, 362 . 

\pmod (notation like (mod p)), 164, 322, 361 . 
\pnt, 395. 
pocket-size, 59. 
poem, 94. 

points (printers’ units), 15, 57-58. 
points with arbitrary coordinates, 389. 

Polish characters, 52-53. 

Polya, Gyorgy (= George), 7. 
poor man’s bold, 386. 

*\postdisplaypenalty, 189-190, 272. 
poultry, 236-237. 
pound sign, see hash mark, 
pound sterling, 54, 428. 
powers of ten, 293. 
powers of two, 293. 

\Pr ( Pr ), 162, 323, 361. 
preambles, 206, 235-249. 

\prec ( *4 ), 436. 

\preceq ( A ), 436. 

*\predisplaypenalty, 189-190, 272 , 348. 
*\predisplaysize, 188, 190, 274, 349. 

(prefix), 275 . 

\preloaded, 350 . 413. 
preloaded formats, 25-26, 39, 283, 344. 
Presume, Livingstone Irving, 74. 
*\pretolerance, 96, 107, 272, 317, 348, 

364 , 394 , 451. 

pretty-printed programs, 165, 234. 
*\prevdepth, 79-80, 89, 271, 281, 282 . 

prevdepth ignored, 88—89. 

*\prevgraf , 103, 188, 190, 214, 271. 

\prime ( / ), 130, 155, 357, 435. 
prime numbers, 148-149, 218. 
primitive, 9-11, 267, 342, 457. 
private control sequences, 344, 364, 414. 
\proclaim, 202-203, 206, 340-341, 355 . 
Procter, William Alexander, 38. 

\prod (large II), 180-181, 435. 
programming with IJ^X, 217-219, 387-388. 
programs, for computers, 38, 165, 234. 
for music, 408-412. 
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proofreading, 59, 303. 
proper names, 73, 92. 

\propto ( tx ), 436. 
prototype row, 238, 302-303. 

Praetorius [Schultheiss], Michael, 239. 

\ps, 403, 404 , 407 . 
pseudo parameters, 119, 349 . 

\psi ( ip ), 325 , 434. 

\Psi ( 'I' ), 434. 

psychologically bad breaks, 91-93. 
pt (printer’s point), 24, 57-58, 268-270. 
Punct atom, 158, 170-171, 289, 

442-443, 446. 

punctuation, 14, 51, 72-76, 321, 394-395. 
in formulas, 134, 154-155, 161, 358-359, 
438; see also Punct atom. 

\qquad, 166, 185, 352 . 
quad, 60, 166-168, 177, 433. 

\quad, 94, 166-167, 185, 232-233, 352 . 

quad left, see flush left. 

quad middle, see \break. 

quad right, see flush right. 

quarter circles, 389-391. 

question mark, 51, 73, 161. 

Quick, Jonathan Horatio, 298. 
quotation marks, 3-5, 24, 394-395. 
quotations, 100, 418-419, 425. 
quotes within quotes, 5. 

IR, see blackboard bold. 

Rad atom, 158, 289, 443. 

*\radical, 157-159, 291, 443. 
ragged bottom margins, 111, 253, 256, 398. 
ragged left margins, 392-394. 
ragged right margins, 29-30, 101, 261-262, 
355-356, 392-394. 

\raggedbottom, 111, 253, 363 . 406. 
\raggedcenter, 107. 

\raggedright, 29-30, 76, 101, 107, 115, 
262, 356, 396, 407. 

*\raise, 66-67, 80, 151, 179, 193, 285, 

290 . 408. 

\rangle ( ) ), H6-147, 150, 359, 437. 
Raper, Matthew, 61. 

\rbrace ( } ), 146-147, 359, 437. 

\rbrack ( ] ), 14 6-147, 351, 369, 437. 
\rceil ( ] ), 146-147, 359, 437. 

\Re ( 5R ), 435. 

*\read, 215, 217-218, 276, 346, 401. 
recent contributions, 112, 125, 281. 
recovery from errors, 30-34, 295-303. 
recursion, 219, 268, 301, 379, 391, 397. 
infinite, 299, 301, 383. 


reduction, 16. 

reference marks, 116-117. 

reference point of a box, 63-64, 77, 

80-82, 388-389. 

(registerdef), 277 . 

registers, 117-122, 212, 214, 346-348. 
registration marks, 416. 

Reid, Brian Keith, 371, 404-406. 

Rel atom, 158, 170-171, 289, 442-444, 
446-447. 

(relation), 209 . 

relations, 133-134, 147, 154-155, 358, 436 . 
437; see also Rel atom. 

*\relax, 23, 25, 71, 240, 276, 279, 307, 353. 
\relbar ( — ), 358 . 

\Relbar ( = ), 358 . 

*\relpenalty, 101, 174, 272, 322 , 348, 446 . 
\removelastskip, 353 . 

\repeat, 217-219, 352 . 
repeating commands, see \loop. 
reserved characters, 37-38, 51-52, 134. 
reserved words, 61_, 71, 268 , 337, 370. 
restricted horizontal mode, 85, 87-89, 
285-287. 
resume, 253. 

(return) (ASCII code 13), 23, 39, 43, 45, 46, 
48, 249, 331, 343, 345, 369, 380. 

\( return), 8, 305, 351 . 
reverse apostrophe, 3-5, 51, 132, 134, 391, 
394-395; see also alphabetic constant, 
reverse slash, see backslash. 

Reviewer, Ann Arbor, 106. 

\rfloor £ j ), 146-147, 359, 437. 

\rgroup, 150, 176, 359, 437. 

\rho ( p ), 128, 325, 434. 

\rhook, 358 . 

*\right, 148-150, 155-157, 171, 196, 

292 . 437. 

right brace, 13-14, 19-21, 38, 51, 200-202, 
203-204, 205-206, 269, 275-276, 

279 , 301, 330. 

(right brace), 275 . 

right bracket, 51, 134, 146-147, 171, 

345, 408, 437. 

right delimiters, see openings, 
right-hand pages, 252-253, .(16. 
right justification, 71. 
right parenthesis, 51, 134, 140, 145-150, 
345. 

right quote, 3-5, 51, 130, 155, 201, 305, 

324, 394-395; see also octal. 
\rightarrow ( — t ), 226, 437. 

\Rightarrou ( => ), 226, 437. 
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\rightarrowf ill ( l), 226, 357 . 

\rightharpoondown ( —r ), 437. 
\rightharpoonup ( — ^ ), 437. 
*\righthyphenmin, 273, 364, 354 , 455. 
\rightleftharpoons ( ), 360—361 , 437. 

\rightline, 101, 31 7, 340-341, 353 . 
*\rightskip, 100-101, 274, 317, 356, 

393, 421. 

\rlap, 82-83, 189, 24 7, 319, 353 , 389, 4 1 6. 
\rm (use roman type), 13-15, 154, 163, 320, 
351 , 364, 409, 414-415, 419, 427. 
\rmoustache, 150, 359 . 
roman letters in math, 162-164. 
roman numerals, 40-41, 252, 256. 
uppercase, 374. 

roman parentheses in italic text, 409-411. 
roman type, 13-17, 51-55, 127, 

162-165, 427. 

*\romannumeral, 40—41, 213, 214, 252. 

Root, Waverley Lewis, 55. 

\root, 130-131, 179, 325, 360 . 
rounding, 58, 119. 

\rq ( ’ ), 5, 48, 351, 369, 395. 
rule boxes, 24, 64, 221-225, 281-282, 

285, 291. 

(rule dimension), 282 . 

(rule specification), 282 . 
rule thickness, 143, 179, 221, 447. 
ruled tables, 245-248, 392. 
rulers, 58. 

runaways, 205-206, 297. 

running headlines, 253, 258-260, 416. 

running the program, 23-35. 

Russian characters, 370. 

\S ( § ), 53, 117, 356, 438-439. 

Sally, Baby, 72-76. 
sample line for tabbing, 232-234. 
sample pages for book design, 412-413. 
save size, 300-301, 374, 399-400. 
save stack buildup, 301, 346. 

\sb, 135, 357, 369. 

\sc (use SMALL CAPS type), 414-415. 
scaled, 16-17, 60, 277, 350, 433. 
scaled points, 57-58, 270. 

Scandinavian letters, 45-46, 52-53. 
Schickele, Prof. Peter, 410-411, 481. 
scratch control sequence, 352. 
scratch registers, 122, 346. 

Scribe, 371, 404. 
script letters, 164, 434. 
script size, 140, 153, 442. 
script style, 140-142, 441-447. 


*\scriptf ont, 153, 168, 213, 271, 321, 

351, 414-415, 441-442. 
scriptscript size, 140, 145, 153, 442. 
scxiptscript style, 140-142, 441-447. 
*\scriptscriptf ont, 153, 168, 213, 271, 
351, 414-415, 441-442. 
*\scriptscriptstyle, 141—142, 179, 292. 
*\scriptspace, 274, 348, 445-446. 
*\scriptstyle, 141-142, I 45 , 179, 292. 
*\scrollmode, 32, 277 . 

\searrow ( \, ), 437. 

\sec ( sec ), 162, 361. 
section number sign, see \S. 

Selden, John, 11. 
selection, see \cases. 
semicolon, 51, 134, 161- 
sentences, 72-76. 

Sesame Street, 61. 

set macro, 379. 

set notation, 147, 174-175. 

*\setbox, 66-67, 77, 81, 120, 276, 386-392. 
*\set language, 287, 455 . 

\setminus ( \ ), 406. 

\settabs, 231-234, 354, 355. 
setting the glue, 70, 77, 81, 388. 
\setupverbatim, 380—381, 391. 

\sevenrm, 15, 153, 350 , 351, 

*\sf code, 76, 214, 271, 286, 321, 345, 351. 
shadow boxes, 66. 

Shakespeare, William, 17, 55, 337. 

(shape assignment), 277 . 

(shape dimensions), 277 . 

\sharp ( j) ), 409, 435. 
sharp S ( fi ), see \ss. 
sharp sign, see hash mark. 

Shaw, George Bernard, 107, 229. 

Sheridan, Richard Brinsley Butler, 265. 
shifted output, see \hoffset, \voffset. 
shilling sign, see slash. 

*\shipout, 227, 253-254, 279, 300, 302. 
shortest paths, 99. 

(shorthand definition), 277 . 

*\show, 10, 215, 279, 299. 

*\showbox, 66-67, 121, 234, 279. 
*\shouboxbreadth, 273, 302 , 303, 348. 
*\showboxdepth, 79, 273, 302 , 303, 348. 

\showhyphens, 364 , .{52. 

*\showlists, 88-89, 95, 112, 125, 158-159, 
279, 293. 

*\showthe, 121, 215, 279. 
shriek, see exclamation point. 

(shrink), 271 . 

shrinkability, 69-71, 75, 409. 
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side conditions, 166-16 7, 185-186. 

\sigma ( a ), 195-196 , 434. 

\Sigma f S ), 165, 434, 435. 
sigma signs, see \sum. 

\signed, 106. 

\sim ( - ), 133, 436. 

\simeq ( ~ ), 133, 436. 

(simple assignment), 276 . 

\sin ( sin ), 162, 361. 

\sinh ( sinh ), 162, 361. 
sinkage, 413. 

size switching, 15, 408, 414-415. 
sizes of type for mathematics, 140, 153. 
\skew, 136, 359 . 

*\skewchar, 214, 271, 273, 277, 351, 

4U, 430, 431, 443. 

*\skip registers, 118-122, 271, 276, 346-347, 
349, 352, 363, 394. 

*\skipdef, 119 , 215, 277, 346-34 7. 

(skipdef token), 271 . 
skipping space, see glue. 

\sl (use slanted type), 13-15, 165, 351 , 

409, 414-415, 419. 
slant of a font, 375, 433. 
slanted type, 13, 63-64, 100. 
slash, 51, 132, 146-147, 320, 430. 

\slash (/ with break allowed), 93, 353 . 
slashed form of fractions, 139-140. 

Slavic hacek accent, see \v. 

\slfam, 351 , 
slides, 59. 

slurs, see \smile, \frown. 
small caps, 414. 

\smallbreak, 111, 353 . 421. 

\smallint ( / ), 358 , 435. 

\smallskip, 70, 78, 100, 109, 111, 181, 
340-341, 352, 355, 4 1 0-4 1 2. 
\smallskipamoiint, 349, 352—353. 
\smalltype, 408—411. 

\smash, 131, 178, 327, 360 . 

\smile ( ^ ), 436. 

solidus, see slash. 

sophisticated spacing, 74. 

sp (scaled point), 57, 118-119, 270, 398, 400. 

\sp (superscript), 135, 357 , 369. 

(space) (ASCII code 32), 39, 43, 46, 343. 

\( space), see \ u (near the beginning). 

\space, 254, 272, 306, 351, 380, 406. 
space after a constant, 208, 272. 
space factor, 76, 285-287, 306, 380, 395. 
(space token), 268, 269 , 282, 285, 290, 
376-377. 

*\spacef actor, 76, 271, 285, 363, 433. 


spaces, 3, 5, 8-9, 19, 37, 40, 47-48, 127, 204, 
232, 272, 297, 299, 319, 336. 
as active characters, 254, 380-381. 
*\spaceskip, 76, 274, 317, 356, 429. 
spacing, see glue. 

in formulas, 162, 167-171. 

\spadesuit ( A ), 358 . 435. 

*\span, 215, 238, 243, 244, 245, 248, 

249, 282, 330, 385. 

Spanish ligatures, 51, 381, 427. 
spanned columns in tables, 243-245. 
spanned rows in tables, 249. 

*\special, 216, 226, 228-229. 280. 
special characters, 37-38, 43-46, 51-52, 

134, 367-371. 

(special dimen), 271, 277. 

(special integer), 271 , 277. 

special symbols for math, 128, 434-438. 

Spivak, Michael David, 137. 

"/, split, 124. 

split insertion penalty, 124. 

*\splitbotmark, 213, 259 , 280. 

*\splitf irstmark, 213, 259 . 280. 
*\splitmaxdepth, 124, 274, 281, 348, 

363, 41 7. 

*\splittopskip, 124, 274, 281, 348, 

363, 397, 41 7. 
spread, 77, 222, 238, 278. 
springs, 70. 

\sqcap ( n ), 133, 436. 

\sqcup ( U ), 133, 436. 

\sqrt, 130-131, 141 , 145, 157, 169-170, 

360 , 443 . 

\sqsubseteq ( [7 ), 436. 

\sqsupseteq ( □ ), 436. 

\square ( □ ), 151. 
square brackets, see brackets, 
square bullet ( ■ ), 420 . 
square root, see \sqrt. 
squeeze routine, 188. 
squiggle accent ('), see \~ , \tilde, 
\widetilde. 
squint print, 59. 

\ss ( fi ), 52, 356 . 

stack positions, 300-301, 374, 399-400. 
stacked fractions, see \over. 

\star ( * ), 436; cf. asterisk, 
states, 46-48. 
sterling, 54, 428. 

Stevenson, Robert Louis Balfour, 67. 

Stirling numbers, see \brace, \brack. 
stomach, see anatomy of T^X. 
stopping TgX, see \end. 
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story.tex, 24, 26, 30-31. 

Stout, Rex Todhunter, 401. 

(stretch), 271 . 

stretchability, 69-71, 75, 409. 

*\string, 40-41, 213-214, 215, 3J,8 , 377. 
\strut, 82, 142, 178, 240, 246-247, 316, 

329, 333, 353, 396, 400, 421. 

\strutbox, 316, 353 , 396, 414~{15. 
struts, 82, 125, 131, 142, 178, 245-247, 

255, 329, 416, 422, 423. 
style change items, 157, 442. 
style-independent documents, 194, 203. 
styles of math formatting, 140-141, 441-447. 
styles of math writing, 166-167, 182-183. 
subformulas, 129, 171, 173. 

(subscript), 291 . 

subscripts, 15, 128-131, 133, 150, 158-159, 
163, 179, 289-291, 343. 

\subset ( C ), 133, 436. 

\subseteq ( C ), 133 , 436. 

\succ ( >- ), 436. 

\succeq ( A ), 436. 

\sum (large S), 139, 144-145, 148-149, 

358 , 432-433, 435. 

\sum\ 181, 324-325. 
summary of plain T^X, 339-342. 
summation, see \sum. 

\sup ( sup ), 162, 361 . 

\superej ect, 116, 254, 256, 257, 353, 407. 
superiors, see superscripts. 

(superscript), 291 . 

superscripts, 128-131, 133, 150, 158-159, 
179, 289-291, 343. 
suppressed-L, 52-53. 

\supset ( D ), 436. 

\supseteq ( D ), 436. 

\surd ( \J ), 358 , 435. 

surd signs, see \radical, \sqrt. 

Swanson, Ellen Esther, 197. 

\swarrow ( ), 437. 

Swedish characters, 52-53, 370. 

Sweelinck, Jan Pieterszoon, 239. 
Swierczkowski, Stanislaw Slawomir, 53. 

Swift, Jonathan, 5. 

symbol fonts, 157, 351, 431, 433, 441, 447. 
symbolic box format, 66, 75, 79, 158-159, 
302, 455. 

symbols in math, table, 434-438. 
syntax rules, 268-269. 

System Development Foundation, vii. 

Szego, Gabor, 7. 


\t (tie-after accent), 52-53, 356 . 

(tab) (ASCII code 9), 8, 45, 343, 369, 391. 
\(tab) , 8, 351. 

\tabalign, 354 . 355. 
tabbing, 231-234, 339, 354 . 

compared to \halign, 235. 
tables, see alignments, tabbing, 
tables of contents, 120, 226. 
tables of T^X trivia: 

accents (non-math), 52, 339. 

atomic types, 158. 

category codes, 37. 

character codes, 367-369. 

default values, 343-345, 348-349. 

\fontdimen parameters, 433, 447. 

keywords, 61. 

math accents, 135. 

math classes, 154. 

math spacing, 170-171. 

math symbols, 434-438. 

parameters, 272-275. 

units of measure, 57. 

\tabs, 234, 354 . 

*\tabskip, 215, 237-239 . 244, W, 274, 

282, 285, 354. 

tabskip glue, 237-239, 245, 302-303, 392. 
Tacitus, Publius Cornelius, 219. 
tags, see equation numbers, 
tail recursion, 219. 

\tan ( tan ), 162, 361. 

\tanh ( tanh ), 162, 361 . 

\tau ( r ), 1, 434. 

Tchaikovsky, see Chaikovskil. 
telephone numbers, 408. 
templates, 235-236, 240-243. 

\tenex, 350 , 351, 361, 414~415. 

\tenpoint, 15, 414 , 418- 

\tenrm, 15, 27-28, 45, 153, 252, 335, 

350 , 351, 414. 

\tensl, 15, 350 . 351, 414- 
tensor notation, 130, 169. 

\tensy, 179, 350, 351, 414. 
terminal, input from, 217-218. 

output to, 217-218, 226-228, 279-280. 
TEX, 1. 

l^X, bad puns on, 1, 11, 63, 153, 161, 225, 
229, 262, 305, 340-341, 373 . 
pronunciation of, 1. 

TeX, 1. 

\TeX ( TeX ), 8-10, 19, 66-67, 204, 225, 
340-341, 356 , 418, 419. 

.tex, 25, 217, 226. 

TeX capacity exceeded, 300—301. 
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TJ^X Grandmasters, 253. 

TJ^X Users Group, vii, 408, 483. 

T^XTS, vii. 
texput, 23. 

text between aligned displays, 193. 
text size, 140, 153, 442. 
text style, 140-142, 441-447. 

*\textfont, 153, 168, 188, 213, 271, 351, 
414-415 , 441-442. 

\textindent, 117, 355 . 

*\textstyle, 141—142, 292 . 326. 

.tfm (font metric files), 401, 433, 442. 

*\the, 214-215. 216, 373, 375, 422. 
theorems, 111, 202-203, 340-341, 355. 
\theta ( 0 ), 128, 162, 325, 434. 

\Theta ( 0 ), 434. 

*\thickmuskip (thick math space), 167-168, 
274 , 349, 446. 

thin spaces, 161, 167-173, 305, 409. 
*\thinmuskip (thin math space), 167-168, 
274 , 349, 446. 

\thinspace, 5, 10, 305, 311, 352 , 409. 

This can’t happen, 299. 

Thor, Arthur Uther, 24, 117, 200, 253. 

three-column output, 257, 396-398. 

three dots, see ellipses. 

tie-after accent, see \t. 

ties, 25, 73-74, 91-93, 104, 161, 167, 

173, 353, 404. 
tight lines, 97, 99, 302. 
tilde, 38, 51, 91-92, see ties. 

\tilde (math accent: x), 135, 164. 
tilde accent (~), see \~ , \tilde, \widetilde. 
*\time, 273, 349. 

\times ( X ), 133, 436. 

Times Tribune, Peninsula, 455. 

Titelouze, Jehan, 239. 

to, 77, 217, 222, 238, 276, 278. 

\to, 134, 361, 438. 

\today, 406. 

token list parameters and registers, 

212, 215, 275, 373. 

token lists, as displayed by T)?X, 228, 

329, 382. 

(token parameter), 275 . 

(token variable), 212, 276 . 
tokens, 38-41, 46-48, 203-207. 

*\toks, 212, 215, 262, 276. 

*\toksdef, 212, 215, 277, 347, 378. 

(toksdef token), 271. 

*\tolerance, 29-30, 91, 94, 96, 107, 272, 

31 7, 333, 342, 348, 364 , 451- 
\top ( T ), 435. 


\topglue, 340, 352 . 

\topins, 256, 363 , 36.(. 

\topinsert, 115-116, 251, 363 . 
ntopmark, 213, 258, 259-260, 280. 
*\topskip, 113-114. 124, 256, 274, 348. 
Trabb Pardo, Luis Isidoro, 92. 

\tracingall, 121, 303, 364 . 
*\tracingconunands, 88—89, 212 , 273, 299. 
*\tracinglostchars, 273, 301 , 3.(8, 401. 
*\tracingmacros, 205 , 212, 273, 329. 
*\tracingonline, 121, 212, 273, 303 . 
*\tracingoutput, 254 . 273, 301-302. 
*\tracingpages, 112-114, 124, 273, 303 . 
*\tracingparagraphs, 98-99, 273, 303 . 
*\tracingrestores, 273, 301 , 303 . 
*\tracingstats, 273, 300 , 303 , 383. 
transcript, see log file, 
transparencies, 59. 

\triangle ( A ), 435. 

\trianglelef t ( <1 ), 436. 

\triangleright ( > ), 436. 
triangular quotation, 101-102. 
tricky macros, 41, 261-263, 354, 360, 

361, 373-401, 421. 
true, 59-60, 270, 407-408. 
truth, vi, 267. 

Tschichold, Jan, 83. 

\tt (use typewriter type), 13, 53, 113, 165, 
340-341, 351, 380-382, 414-4 1 5, 129 . 
\ttfam, 351 , f/f .(15. 

\ttglue, 414-415, 421. 

\ttraggedright, 356 . 

TUGboat, vii, 425, 483. 
turtle commands, 390-391. 

Twain, Mark (= Clemens, Samuel 
Langhorne), 83, 451. 
two-column format, 257, 386-388, 41 7. 
two-line displays, 196. 
type size switching, 15, 408, 414-415. 
typefaces, 13, 17, 427. 
typewriter type, 13, 53, 165, 356, 

420-421, 429. 

\u (breve accent), 52-53, 356 . 

*\uccode, 41, 214, 271, 345, 348, 377, 394 • 
*\uchyph, 273, 348, 454 . 
umlaut accent ("), see \", \ddot. 
unbreakable spaces, see ties. 
\uncatcodespecials, 380. 

\undefined, 384. 

undelimited parameters, 203-204, 376-377. 
Under atom, 158, 289, 443. 

\underbar, 2.(4', 323, 353 . 

\underbrace, 176, 225-226, 359. 
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underfull box, 29, 94, 238, 302-303, 

397, 400, 417, 452. 

*\underline, 130-131, 141, 291, 443 . 
underlined text, 178; see also \underbar. 
underscore (the character “_’), 38, 51, 
128-130, 134, 165. 

*\unhbox, 120, 283, 285, 35.{, 356, 361, 399. 
*\unhcopy, 120, 283, 285 , 353. 

(unit of measure), 268, 270 . 
units of measure, 57-61, 270. 
in formulas, 169. 
table, 57. 

*\unkern, 280 . 
unmatched left brace, 216; 
see also runaways. 

*\unpenalty, 280 . 
unset box, 240, 302-303. 

(unsigned dimen), 270 . 

(unsigned mudimen), 270 . 

(unsigned number), 269 . 

*\unskip, 222-223, 280, 286, 313 , 392, 
418-419. 

*\unvbox, 120, 254, 282, 286, 354, 361, 

363, 364, 392, 399, 4 1 7. 

*\unvcopy, 120, 282 . 286, 361. 

\up, 408. 

\uparrow ( f ), 1 46-147, 182, 359, 437. 
\Uparrow ( f|- ), 1 46-147, 359, 437. 
\upbracef ill (s v 1) 225-226, 357 . 
\updownarrou ( ^ ), 146-147, 359 , 437. 
\Updownarrow ( ), 146-147, 359 , 437. 

\uplus ( ttl ), 436. 

*\uppercase, 41, 215, 217, 279, 307, 345, 
3.(8, 374, 377, 394. 
uppercase letters, 9, 64, 268, 370. 

Greek, 127, 164, 434. 
roman numerals, 374. 

\upsilon ( v ), 434, 483. 

\Upsilon ( T ), 434. 

Ursa Major, 389. 

User, Ben Lee, 74, 80, 100, 140, 164, 186, 
187, 193, 221, 296, 410-411. 

\v (check accent), 52, 356 . 

*\vadjust, 95, 105, 109, 110, 117, 259, 

281 , 317, 393, 454. 

*\valign, 249, 283, 285-286, 302, 335, 397. 
Vallee Poussin, Charles Louis Xavier 
Joseph de la, 92. 

van der Waerden, Bartel Leendert, 92. 
van Leunen, Mary-Claire, 425. 

\varepsilon ( £ ), 128, 434. 

(variable assignment), 276 . 
variable family, 154, 289, 358-359. 


variables in formulas, 132, 358, 434-435; 

see also Old atom. 

\varphi ( <p ), 128, 147, 434. 

\varpi ( vj ), 434. 

\varrho ( Q ), 128, 434. 

\var sigma ( c ), 434, 483. 

\vartheta ( 6 ), 128, 434. 

*\vbadness, 272, 348, 397, 417. 

vbox (box with vertical list inside), 65. 
*\vbox, 65, 80-82, 103, 151, 193, 222, 

278, 388-389. 

Vcent atom, 158, 290, 443. 

*\vcenter, 150-151, 159, 170, 193, 222, 

242, 290, 361, 443 . 

\vdash ( h ), 436. 

\vdots ( : ), 177, 359 . 

\vec (math accent: ,c), 135; 

see also \ overright arrow, 
vectors, 177, 199-201; see also \vec. 

\vee ( V ), 133, 436. 
verbatim copying, 422-423. 
verbatim listing, 48, 380-382, 391, 420-421. 
\vert ( I ), 1.(6-147, 150, 359, 438; cf. \mid. 
Wert ( || ), 117, 146-147, 150, 359, 486. 
(vertical command), 286 . 
vertical line (the character ‘I’), 52, 53, 132, 
146-147, 171, 174, 438; 
see also vertical rules, 
vertical lists, 64, 110 . 
vertical mode, 85-89, 267-283 . 

(vertical mode material), 278. 

(vertical rule), 281 . 

vertical rules, 64, 151, 221-226, 285 . 

291 , 392. 

(vertical skip), 281 . 
very loose lines, 97, 99. 

*\vf il, 71, 72, 111, 256, 281, 286, 417. 

*\vf ill, 24, 25, 71, 72, 256-257, 281, 286. 
\vf illl, 72. 

*\vf ilneg, 72, 111, 281, 286. 

\vfootnote, 117, 363 . 

*\vfuzz, 274 , 348. 

\vglue, 352 , 40 8. 
vinculum, see \overline. 
virgule, see slash, 51. 
visible space, 3, 420 , 429. 

*\vof f set, 251, 252-253, 274, 342, 406. 
void, 120, 210. 

WoidbSx, 347 , 348. 

Voltaire, de (= Arouet, 

Prangois Marie), 447. 

\vphantom, 178-179, 211, 321, 360- 
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*\vrule, 64, 86, 151, 221-222, 224, 245-24 7, 
281-282, 283, 357, 392, 420. 

*\vsize, 113-114, 251, 253, 255, 274, 

340-341, 348, 400, 4 06, 413 , 4 1 5, 4 1 7. 
*\vskip, 24, 71, 85, 191, 281, 286. 

*\vsplit, 124, 222, 259, 278, 397, 417. 
*\vss, 71, 72, 255, 281, 286. 

*\vtop, 81-82, 151, 222, 278, 333. 

Walter’s worksheet, 244. 

Walton, Izaak, 67. 

Watson, John Hamish, M.D., 401. 

*\wd, 120, 271, 388-389, 391, 417. 

Wedge ( A ), 133, 436. 

Weierstrass p, see \wp. 
weird error, 298. 

Welsh, 239, 451. 

Westing, Arthur Herbert, 246. 
whatsits, 95, 110, 157, 226-229. 

\widehat (math accent: x), 136, 359 , 433. 
\widetilde (math accent: x), 136, 359 . 
widow lines, 104, 272, 398. 
widow words, 104. 

*\widowpenalty, 104, 113 , 272, 348. 
width, 221, 224, 282, 337. 
width of a box, 63-67, 77, 80-82, 225. 
wiggle, see \sim. 

Winograd, Terry Allen, 425. 

Wischmeyer Ehlert, Clara Louise, 248. 
uithdelims, 152, 156. 


\wlog, 347 . 

Wolfe [Holmes], Nero, 401. 

\wp ( p ), 435. 

\wr ( l ), 436. 

wreath product, see \wr. 

*\write, 215, 216, 226-228, 254, 280, 

346, 377, 422, 424. 

X, see \times, \*. 
x-height, 54, 60, 433, 443, 445-447. 
*\xdef, 215-216, 275, 373, 4I8, 424. 
Xenophon, 239. 

\xi ( | ), 434, 447. 

\Xi ( S ), 434. 

*\xleaders, 224 . 

*\xspaceskip, 76, 274, 317, 356, 429, 433. 
\xyzzy, 379. 

*\year, 41, 273, 349, 406. 

Yellin, Prank Nathan, 401. 

\z@, 347, 348. 

\z@skip, 347 , 348. 

\zeta ( C ), 434. 

1 / 2 , 67, 332. 

1/2, in unslashed form, 141, 186. 

(4-bit number), 271 . 

(8-bit number), 271, 276-278. 

(15-bit number), 271, 277, 289, 291. 
(27-bit number), 271, 289, 291. 


HIGHTON 

SCHICKELE 

Bach 


Important works such as histories, biographies, 
scientific and technical text-books, etc., should contain indexes. 
Indeed, such works are scarcely to be considered complete without indexes. 

An index is almost invariably placed at the end of a volume 
and is set in smaller type than the text-matter. 
Its subjects should be thoroughly alphabetized. 
The compiling of an index is interesting work, though 
some authors are apt to find it tedious and delegate the work to others. 
The proofreader who undertakes it will find that it is splendid mental exercise 

and brings out his latent editorial capability. 
— ALBERT H. HIGHTON, Practical Proofreading (1926) 

Important references are given in boldface. 
Italicized numbers indicate fleeting references, 
whereas numbers in parentheses refer to 
mere implications or unwarranted extrapolations. 
Asterisks are used to identify particularly distasteful passages. 
— PROF. PETER SCHICKELE, The Definitive Biography of P. D. Q. Bach (1976) 
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This appendix is about grouping of another kind: Tj^X users from around the 
world have banded together to form the TgX Users Group (TUG), in order to 
exchange information about common problems and solutions. 

A newsletter/journal called TUGboat has been published since 1980, 
featuring articles about all aspects of TgX and METRFONT. TUG has a network 
of “site coordinators” who serve as focal points of communication for people with 
the same computer configurations. Occasional short courses are given in order 
to provide concentrated training in special topics; videotapes of these courses are 
available for rental. Meetings of the entire TUG membership are held at least 
once a year. You can buy TgX T-shirts at these meetings. 

Information about membership in TUG and subscription to TUGboat 
is available from 

{\obeylines 
\TeX\ Users Group 
P.0. Box 9506 

Providence RI 02940\kern.05em-9506, USA. 

} 

Don’t delay, write today! That number again is 
TgX Users Group 
P.O. Box 9506 

Providence RI 02940-9506, USA. 


METAFONT 

HORNSCHUCH 

upsilon 

varsigma 

BEETON 


[The printer] should refuse to employ wandering men, 
foreigners who, after having committed some grievous error, 
can easily disappear and return to their own country. 

— HIERONYMUS HORNSCHUCH, ’ OpQoTVKO^pa<t>ia<; (1608) 

An author writing an article for publication in TUGboat 
is encouraged to create it on a computer file and submit it on magnetic tape. 
— BARBARA BEETON, \title How to Prepare a File \ cr 

For Publication in TUGboat\cr (1981) 



